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ABSTRACT 


With the advances in high speed, programmable digital signal processing (DSP) 
chips, modem communications links are using a combination of DSP techniques and digital 
communications methods to realize faster, reconfigurable, and modular systems. This 
thesis details the software implementation of a modem digital communication system 
combining various DSP functions, channel Forward Error Correcting (FEC) algorithms, 
and digital modulation methods. The digital modulation schemes considered here include 
both baseband and Quadrature Phase Shift Keying (QPSK) techniques. The proposed 
communication system will serve as a practical tool useful for simulating the transmission 
of any digital data. The various modules of the system include source encoders/decoders, 
data compression functions, channel encoders/decoders, and modulators/demodulators. 
Implementation consists of coding the various link functions in C and integrating them as 
a complete system. The results show the viability of a QPSK modulated digital 
communications link and point the direction of future research towards software radio. 
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1. INTRODUCTION 


This thesis details the implementation in software of a modern digital 
communication link, combining various Digital Signal Procesing (DSP) functions, 
Forward Error Correction (FEC) channel coding algorithms, and digital modulation 
methods, mainly Quadrature Phase Shift Keying (QPSK). Software implementation is 
performed in the C++ and C programming languages and consists of separately coding and 
interfacing the various functions of the system. By modularizing the various functions 
which are performed on the data from source to destination, it becomes convenient to 
change individual sections of the link and model the effects of different transmission 
conditions on data as it is passed through the channel. The modules of the link include 
source encoders/decoders, channel encoders/decoders, modulators/demodulators, and 
channel effects. 

The possible configurations of the link are numerous, so focus was maintained on 
a typical system that might be used in satellite communications. For this reason 
convolutional channel coders and QPSK modulation were chosen. Additionally, channel 
effects were primarily modeled as the combination of bandlimited additive white gaussian 
noise (AWGN) and signal attenuation. The results of this link on speech transmission show 
the various gains and tradeoffs that are realized as data passes through the entire system; 
the quantitative performance is measured by the probability of bit error, and the effect 
various signal-to-noise ratios have on this probability. 

The bulk of the actual implementation was performed in C and C++ while most of the 
algorithm testing and system design was accomplished using MATLAB. This required 
writing MATLAB scripts and converting them to C. While the bulk of the operational 
routines and algorithms are written using C constructs, all input/output and file accesses are 
accomplished with C++ notation; therefore, all of the code has been compiled with a C++ 
compiler. Full conversion to C is possible with appropriate changes to the file operations. 


1 



The thesis is organized in the following manner: Chapter II details the digital 
communication model used in this work. Chapter III discusses the DSP methods involved, 
specifically source encoding/compression standards used for speech. Chapter IV outlines 
the channel coding methods involving FEC and Viterbi algorithms while Chapter V 
presents the digital communications techniques used including the baseband and QPSK 
modulation schemes. Overall implementation and results are reported in Chapter VI. 
Conclusions and areas for further development are contained in Chapter VII. Appendices 
A through E include supporting code and documentation. 
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II. DIGITAL COMMUNICATION MODEL 


This chapter discusses the basic digital com m unication model which is the 
backbone of the work presented in the rest of this thesis. The model is broken into its 
constituent functions or modules, and each of these is in turn described in terms of its 
affects on the data and the system. Since this model comprises the entire system, both the 
source coding and communications techniques involved are briefly described. In chapters 
ni through VI, these two areas will be covered in detail, and the specific algorithms and 
methods used in the software implementation will be addressed in detail. 

A. SOURCE ENCODING AND DECODING 


The basic digital communication model is depicted in Figure 2.1 below. The first 
three blocks of the diagram (source encoder, channel encoder, and modulator) together 



Figure 2.1; Digital Communication Model 


comprise the transmitter. The source represents the message to be transmitted which 
includes speech, video, image, or text data among others. If the information has been 
acquired in analog form, it must be digitized prior to subsequent manipulation. This analog 
to digital conversion (ADC) is accomplished in the source encoder block. 

The last three blocks consisting of detector/demodulator, channel decoder, and 
source decoder form the receiver. The destination represents the client waiting for the 
information. This might include a human or a storage device or another processing station. 
In any case, the source decoder’s responsibility is to recover the information from the 


3 




channel decoder and to transform it into a form suitable for the destination. This 
transformation includes digital to analog conversion (DAC) if the destination is a human 
waiting to hear or view the infomation or if it is an analog storage device. If the destination 
is a digital storage device, the infomation will be kept in its digital state without DAC. [Ref. 
1 ] 

B. COMPRESSION 

The source encoder block has the additional task of compressing the data. The 
essence of digital data compression is to represent a given set of data in a minimum number 
of bits such that the original signal can be retrieved from this compact representation with 
minimal loss of information. This is achieved by eliminating redundancies that exist in the 
original digital signal. By representing the data with fewer bits, the burdens on storage 
devices are reduced and transmission requirements are lessened thereby freeing storage 
space and bandwidth for other data. The effectiveness of a compression algorithm is 
measured by its compression ratio, C, defined as 



where Z?,- is the number of bits used before compression, and is the number of bits used 
after compression. 

There are two types of compression algorithms; lossless and lossy. As the name 
implies, lossless compression methods will produce a representation from which the 
original signal can be recovered exactly with no loss of data; however, the corresponding 
compression ratios are not very high. Lossy compression methods achieve higher 
compression ratios but permanently lose some of the information in the original signal; 
therefore, the recreated signal is not an identical replica of the original. If the signal is 
destined directly for human consumption, this is not harmful. For many speech and image 
signals, the human ear and eye either compensate for the discrepancies or cannot identify 
them; however, lossy methods are generally not acceptable if, after recovery, the signal 
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needs to undergo further processing. The source decoder block performs signal 
reconstruction from the compressed data after it is received from the channel decoder. 

C. CHANNEL ENCODING AND DECODING 

One of the advantages of digital communications over analog communications is its 
robustness during transmission. Due to the two-state nature of binary data (i.e,. either a 1 
or a 0), it is not as susceptible to noise or distortion as analog data. While even the slightest 
noise will corrupt an analog signal, small amounts of noise will generally not be enough to 
change the state of a digital signal from 1 to 0 or vice versa and will in fact be ‘ignored’ at 
the receiver while the correct information is accurately recovered. 

Nevertheless, larger amounts of noise and interference can cause a signal to be. 
demodulated incorrectly resulting in a bitstream with errors at the destination. Unlike an 
analog system, a digital system can reduce the effect of noise by employing an error control 
mechanism which is used prior to modulation. The channel encoder performs this error 
control by systematically introducing redundancy into the information bitstream after it has 
been source encoded but prior to its transmission. This redundancy can then be used by the 
receiver to resolve errors that might occur during transmission due to noise or interference. 

The channel decoder performs the task of decoding the received coded bitstream by 
means of a decoding algorithm tailored for the encoding scheme. Error control of this 
variety that allows a receiver to resolve errors in a bitstream by decoding redundant 
information introduced at the transmitter is known as Forward Error Correction (FEC). The 
price paid for employing FEC is the increased bit rate and complexity of the transmitter and 
receiver. The specifics of the FEC encoding and decoding algorithms used will be 
expanded upon in Chapter IV. [Ref. 2] 

D. MODULATION 

The digital modulator serves as an interface between the transmitter and the 
channel. It serves the purpose of mapping the binary digital information it receives into 
waveforms compatible with the channel. In baseband modulation, the output waveforms 
are simple voltage pulses which take predefined values correponding to a 1 or 0. However, 
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many channels, such as a satellite channel, are not suited for baseband communication and 
require the incoming data to be modulated to a higher frequency, referred to as the carrier 
frequency, so it can be converted to an electromagnetic wave that will propogate through 
space to its destination (e.g., a satellite or a ground station). This type of modulation, known 
as bandpass modulation, varies one of the following three parameters of the carrier 
frequency based on the the incoming digital bitstream: amplitude, frequency or phase. 
These modulation types are commonly known as Amplitude Shift Keying (ASK), 
Frequency Shift Keying (FSK) and Phase Shift Keying (PSK), respectively. It is the last of 
these which is of primary interest and will be examined in more detail in Chapter V. 

The digital detector/demodulator reverses the process and extracts the binary 
baseband information from the received modulated signal which has been subjected to 
noise, interference, loss, and other distortions. The demodulator produces a sequence of 
binary values which are estimates of the transmitted data and passes it on to the channel 
decoder. [Refs. 2 and 3] 

E. CHANNEL EFFECTS 

During transmission, the signal undergoes various degrading and distortion effects 
as it passes through the medium from the transmitter to the receiver. This medium is 
commonly referred to as the channel. Channel effects include, but are not limited to, noise, 
interference, linear and non-linear distortion and attenuation. These effects are contributed 
by a wide variety of sources including solar radiation, weather and signals from adjacent 
channels. But many of the prominent effects originate from the components in the receiver. 
While many of the effects can be greatly reduced by good system design, careful choice of 
filter parameters, and coordination of frequency spectrum usage with other users, noise and 
attenuation generally cannot be avoided and are the largest contributors to signal distortion. 

In digital communication systems, a common quantity used to determinine whether 
a signal will be detected correctly is the ratio of energy per bit to spectral noise power 
density, Ef/Ng, measured at the detector. The higher the the lower the resulting bit error 
rate (BER), or the probability of bit error, Py. Unfortunately, a high demands greater 
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power consumption at the transmitter; in some cases, it may be unfeasible to obtain a high 
Ely due to transmitter size or power limitations as in the case of satellite transmission. Noise 
effects and further study of the parameter Ei/Ng will be covered in Chapters V and VI. [Ref. 
2 and 3] 

The digital communication system described consists of an ordered grouping of 
various modules which operate on an input data sequence. In practice, these modules or 
resources are not dedicated to a single source/destination, but they are shared by multiple 
sources and their destinations to achieve optimum utilization. In a digital system, the 
transmission bit rate is an important system resource. A given information source of 
bandwidth B, sampled at 2B samples/second using q bits per sample results in a data rate, 
R, of 2Bq bits per second. With a compression ratio C, the data rate from the source encoder 
is Rg = RIC bits per second. Channel coding by a factor n leads to a coded data rate of R^ = 
R/i bits per second; R^ is the system transmission bit rate. These bits are then used by the 
modulator to form the transmission waveforms which have to be accomodated within the 
available bandwidth. At the receiver these steps are performed in the reverse order to 
recover the information sequence. 
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III. SOURCE CODING METHODS 


In the digital communication system model described previously, the source encoder 
is responsible for producing the digital information which will be manipulated by the 
remainder of the system. After the digital signal is acquired from the analog information, 
the source encoder subjects it to a wide range of processing functions, the goals of which 
are to compactly represent the information. Speech, image, and textual information each 
have their own unique characteristics that require different source encoding techniques. 
Depending on the information source, different digital signal processing functions are 
implemented to remove the redundancies inherent in the given signal. The specifics of the 
speech compression techniques used in this thesis are detailed below. 

* 

A. SPEECH COMPRESSION 

Since the frequency content of spoken language is confined to frequencies under 
4000 Hz, it is reasonable to use a sampling frequency of 8000 Hz [Ref. 4]. Using 16 bit 
linear pulse code modulation (PCM) as the quantization method results in a bit rate of 128 
kbps. Subsequent analysis, coding, and compression of speech are performed on segments 
or frames of 20 to 30 ms duration. 

There are two broad categories of speech coding/compression. Both categories are 
concerned with representing the speech with the minimum number of applicable 
parameters while also allowing the speech to be intelligibly reproduced; both are lossy in 
nature. The first category deals with waveform coders which manipulate quantities in the 
speech signal’s frequency representation. Typical analysis tools of waveform coders are the 
discrete fourier transform (DFT) and the discrete wavelet transform (DWT), both of which 
transform the time signal to its frequency domain representation. In this case, compression 
might potentially be achieved by retaining the frequency components with the largest 
magnitudes. 

The second category of speech compression deals with voice coders, or vocoders 
for short. Vocoders attempt to represent speech as the output of a linear system driven by 
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either periodic or random excitation sequences as shown in Figure 3.1, 



Figure 3.1 Basic Model of a Vocoder 


A periodic impulse train or a white noise sequence, representing voiced or unvoiced 
speech, drives an all-pole digital filter to produce the speech output. The all-pole filter 
digital filter models the vocal tract. [Ref. 4] 

Additionally, estimates of the pitch period and gain parameters are necessary for 
accurate reproduction of the speech. Due to the slowly changing shape of the vocal trXct 
over time, vocoders successfully reproduce speech by modeling the vocal tract 
independently for each frame of speech and driving it by an estimate of a separate input 
excitation sequence for that frame. Most vocoders differ in performance principally based 
on their methods of estimating the excitation sequences. 

Linear predictive coding (LPC) or autoregressive (AR) modeling of speech is used 
to obtain the parameters of the all-pole model for a given frame of speech: 

sin) = Gu(n)-ais(n-l)-a2sin-2)-...-apSin-P) (3.1) 
where a- (i = 1, 2,..., P) are the coefficients of the the all pole filter, and P is the order of 

the filter. A widely used notation used to describe this AR model is given by LPC-P with P 
= 10 being a common choice as in the Federal Standard FS-1015 vocoder scheme. Below' 
a specific refinement of the LPC technique known as code-excited linear prediction 
(CELP) is detailed. 
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B. CODE-EXCITED LINEAR PREDICTION (CELP) 

Although the data rate of plain LPC coders is low, the speech reproduction, while . 
generally intelligible, has a metallic quality, and the vocoder artifacts are readily apparent 
in the unnatural characteristics of the sound. The reason for this is because this algorithm 
does not attempt to encode the excitation of the source with a high degree of accuracy. The 
CELP algorithm attempts to resolve this issue while still maintaining a low data rate. 

Speech frames in CELP are 30 ms in duration, corresponding to 240 samples per 
frame using a sampling frequency of 8000 Hz. They are further partitioned into four 7.5 ms 
subframes of 60 samples each. The bulk of the speech analysis/synthesis is performed over ‘ 
each subframe. 

The CELP algorithm uses two indexed codebooks and three lookup tables to access 
excitation sequences, gain parameters, and filter parameters. The two excitation sequences 
are scaled and summed to form the input excitation to a digital filter created from the LPC 
filter parameters. The codebooks consist of sequences which are each 60 samples long, 
corresponding to the length of a subframe. 

Figure 3.2 shows a schematic diagram of the CELP analyzer/coder. The stochastic 
codebook is fixed containing 512 zero-mean Gaussian sequences. The adaptive codebook 
has 256 sequences formed from the input sequences to the digital filter and updated every 
two subframes. A code from the stochastic codebook is scaled and summed with a gain- 
scaled code from the adaptive codebook. The result is used as the input excitation sequence 
to an LPC synthesis filter. The output of the filter is compared to the actual speech signal, 
and the weighted error between the two is compared to the weighted errors produced by 
using all of the other codewords in the two codebooks. The codebook indices of the two 
codewords (one each from the stochastic and adaptive codebooks), along with their 
respective gains, which minimize the error are then coded for transmission along with the .* 
synthesis filter (LPC) parameters. Because the coder passes each of the adaptive and 
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stochastic codewords through the synthesis filter before selecting the optimal codewords, 



1. Speech Synthesis 

Figure 3.3 provides a more detailed view of the CELP synthesizer. Indices 4 and 


identify the codewords used to form the input excitation sequence, and the gain parameters 
Gg and scale the codewords before their sum is passed to the linear prediction filter 
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which is formed with line spectrum pairs (LSPs). The coded CELP frame includes the 
indices of the codebook entries, their gain parameters, and the LSP parameters. 

The adaptive codebook entries are 60 samples long and are used to estimate the 
pitch period. The codebook consists of a 147 stage shift register. On odd subframes, the 
input excitation sequence used by the prediction filter is fed into the shift register, and the 
values on the other side are shifted out and discarded 60 samples at a time. The shift register 
values are used to form the codebook for use on the odd and the next even subframes; 
therefore, the synthesis filter excitation sequences used on even subframes are not sent to 
the register and are discarded. Of the total 256 sequences in the codebook, 128 represent 
integer delay which can be accessed directly from the shift register, and the remaining 128 
represent non-integer delay which are formed by interpolation from the integer delay 
sequences. [Ref. 5] 

As implied in the above description, synthesis of the speech is carried out with a 
different filter excitation sequence at every subframe. A coded CELP frame contains four 
sets of indices (one per subframe) corresponding to the codebook and gain values for each 
of its four subframes. During synthesis at the destination, the indices of the optimal 
codewords are used to reference the correct code sequences. Similarly, the indices of their 
respective gains are used to access the scaling parameters prior to use by the prediction 
filter. Both gain parameters are found in two separate fixed indexed codebooks of 32 entries 
each. [Ref. 5] 

2. Linear Prediction Filter 

The LPC prediction filter is a tenth-order filter derived using standard linear 
predictive techniques. A 30 ms Hamming window is used to window the data and the filter 

' 4 

coefficients are found by the autocorrelation method [Ref 7]. Instead of transmitting the 
filter coefficients or the pole locations of the prediction filter, the corresponding line 
spectrum pairs (LSP) of each of the poles are encoded and transmitted. The reason for this 
is that the LSPs provide efficient channel error resilient coding and can also be transmitted 
using fewer bits than the corresponding poles or filter coefficients [Ref 5]. The description 
on LSPs roughly follows the presentation given in Deller and Frerking [Refs. 4 and 5]. 
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The poles of the prediction filter, lM(z), are the roots of the polynomial 

N 

A{z) = 1 - X ^3-2) 

i = 1 

where are the LPC parameters. The idea behind LSPs is to use A(z) to construct two 
functions 

P(z) = (3.3) 

and 



whereis the sampling frequency (8000 Hz). Other than the real zeros at z = 1 for P(z) 

and at z = -1 for Q(z), the remaining N zeros each of P(z) and Q(z) are complex conjugate 
pairs which lie on the unit circle and can be represented by the real frequenciesand/^^, 
respectively, in equations (3.6) and (3.7). Furthermore, these complex conjugate pairs are 



interleaved as shown in Figure 3.4, so each zero of A(z) corresponds to a pair of zeros, one 



each from P(z) and Q(z). The closer a pair of zeros of P(z) and Q(z) are to each other, the 
closer the corresponding zero of A(z) is to the unit circle. 

To more accurately reflect the time changing nature of the vocal tract, the 30 ms 
frame over which the LPC parameters are computed is displaced from the 30 ms speech 
analysis frame which determines the input excitation parameters as shown in Figure 3.5. 



Figure 3.5 CELP Speech Analysis and LPC Analysis Frames 


Specifically, the current LPC frame straddles the last two subframes (15 ms) of the current 
analysis window and the first two subframes (15 ms) of the next analysis window. 

In the CELP algorithm, ten fixed sets of LSPs (corresponding to ten different 
prediction filters) are available in a lookup table during analysis and synthesis (see [Ref 5] 
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for the lookup table values), and the closest set is chosen for a given solution of the 
prediction filter. This is done for coding efficiency; however, once the values of the LSPs 
have been obtained from the lookup table, they are adjusted throughout the CELT analysis 
and synthesis routines. Just as the input excitation sequence, gain, pitch period, and pitch 
gain are adjusted every subframe, the LSPs are interpolated each subframe by different 
weighting factors provided in Table 3.1. For example, if LSPl of the previous frame is 420 
Hz and LSPl of the current frame is 280 Hz, then LSPl for subframe 1 (of the current 
frame) is: 7/8(420 Hz) + 1/8(280 Hz) = 402.5 Hz. LSP interpolation every subframe causes 
the corresponding prediction filter to change every 7.5 ms thereby producing a smoother 
representation of the changing filter characteristics of the vocal tract compared to the 
relatively abrupt characteristics produced by non-overlapped, uninterpolated 30 ms 
windows. 


TABLE 3.1: Interpolation Factors for LSPs 


Subframe 

Previous 

Frame 

Current Frame 

1 

7/8 

1/8 

2 

5/8 

3/8 

3 

3/8 

5/8 

4 

1/8 

7/8 


3. Speech Analysis 

The analysis stage of CELP comprises the synthesizer described above along with 
the weighting filter and error minimization function as shown previously in Figure 3.2. 
After a pair of adaptive and stochastic codewords have been scaled, summed, and filtered 
with the prediction filter, the result is compared to the actual speech. The difference 
between the two represents the error and is sent to the error weighting filter, also know as 
the perceptual weighting filter. This filter modifies the error signal by emphasizing the 
areas of the spectrum to which the human ear is most sensitive. After perceptual weighting 


16 


of the error, the filter output is squared and summed to yield the error energy for the given 
stochastic and adaptive codebook pair. 

The general procedure is to first pass every adaptive codeword (pitch period) and 
gain (pitch gain) through the synthesis filter to find the best scaled adaptive codeword. 
Once this has been found, each entry in the fixed codebook is filtered while carrying out a 
simultaneous search for the corresponding gain. [Ref 5] 

4. Compression Results 

The CELP coder produces a 144 bit word for each 30 ms analysis frame of speech. 
This results in a total bit rate of4800 bps or a compression ratio of approximately 27:1. The 
exact bit allocation per frame can be found in the federal standard [Ref 6]. 

Although CELP has a bit rate twice that of the LPC-10 algorithm, it provides much 
better intelligibility. The improvement in performance is due to the dynamic use of the 
gain-scaled stochastic and adaptive codebooks in determining the input excitation 
sequences to the linear prediction filter. However, this increased performance is at the cost 
of increased computational load. The exhaustive search of the two codebooks and gain 
value lookup tables, and the interpolation routines for the adaptive codebook and LSPs 
combine to produce approximately an order of magnitude increase in computation over 
LPC-10 [Ref. 4]. The Federal Standard [Ref. 6] allows for the adjustment of the codebook 
sizes and of the interpolation routines used for the adaptive codebooks to reduce 
computational overhead, but this comes at the cost of reduced accuracy in the reproduced 
speech. 

C. CELP VARIATIONS 

There are two important variations of the CELP method, namely, the low delay 
CELP (LD-CELP) and the vector sum excited linear prediction (VSELP). 

1. LD-CELP 

As the name implies, LD-CELP attempts to reduce the delay incurred by the heavy 
computational load of the standard CELP routine. The main differences of this method with 
the standard CELP are the addition of a tenth-order, backward-adaptive linear gain 
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predictor which scales the excitation sequence and the elimination of the pitch estimation 
process, thereby removing the adaptive codebook/gain functions. Without the adaptive 
codebook, LD-CELP uses a 50th order prediction filter to compensate for the loss of pitch 
information. Additional differences include more frequent updates of the LPC coefficients 
(every 2.5 ms), significantly shorter excitation sequences (5 samples versus 60), and 
smaller frame/subframe lengths (10 ms/2.5 ms). The result of these modifications is an 
output data rate of 16000 bps with one-way delay of approximately 2 ms compared to othet 
non-CELP based vocoders operating at the same data rate but with delays on the order of 
20-40 ms [Ref 4], For full details on the parameters of LD-CELP and its implementation, 
the reader is referred to the ITLF-T recommendation G.728. 

2. VSELP 

VSELP is an important variation of CELP because it is now the standard for North 
American digital cellular phone systems as defined in the IS-54 standard. The encoder uses 
the sum of three scale-adjusted sequences to form the input to a tenth-order synthesis filter. 
Each of the three codebooks contains 128 sequences that are 40 samples long. An analysis 
frame length of 20 ms is used, and the corresponding subframes are 5 ms each. Two of the 
codebooks are formed from two separate sets of seven basis codewords, and together they 
form an excitation sequence similar to that produced by the stochastic codebook in CELP; 
however, unlike CELP, the seven basis codewords of each codebook are optimized over a 
training database to minimize the perceptually weighted error [Ref 4]. The third codebook 
plays the role of the adaptive codebook in CELP and is formed from the summed sequence 
input to the synthesis filter, and during every subframe its entries are correlated to the actual 
speech signal to yield the best pitch estimate. The output data rate of VSELP is 8000 bps. 
[Ref 4] 

D. IMPLEMENTATION 

The standard CELP routine used in this thesis is based on FS-1016, and the code 
was developed by AT&T but available free as public domain software. It was modified by 
the author to compile and run on Sun SPARC workstations. The complete source code 
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listings and a compiled routine are now stored in the /tools_res directory of the ECE 
Department computer system. 

The compiled encoding routine accepts only 16-bit linear PCM speech sampled at 
4 = 8000 Hz and outputs the 144 bit long words in packed hexadecimal format (36 hex 
characters per frame). Since SPARC workstations process speech using the Sun ‘.au’ 
format which is 8 bit p -law compressed data, all speech files recorded on SPARCstations 
were converted to 16 bit linear PCM prior to usage by means of public domain conversion 
routines released by Sun Microsystems, Inc. Similarly, the decoding routine output was 
converted back to 8 bit p -law data. 

All intermediate processing from source encoder output to source decoder input 
was performed in binary, so the hexadecimal output from the CELP coder was converted 
to unpacked unipolar binary format (0 or 1) with the routine hex2bin.C, and the reverse 
process was accomplished with bin2hex.C. Both routines are included in Appendix A. 

The CELP algorithm provides an effective means of speech reproduction while 
maintaining a high compression ratio with a corresponding low bit rate. Due to the growing 
use of CELP based coding methods in commercial systems, it is useful to study these 
schemes. The CELP code is available as public domain software which enables a study of 
CELP and its performance in conjunction with a digital communication system. 
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IV, CHANNEL CODING 


Unlike source coding which seeks to remove redundant information in a signal, the 
goal of channel coding is the opposite in the sense that it adds redundancy to a bit stream 
which allows the detector at the receiver to detect and/or correct errors which might have 
been introduced during transmission. As mentioned in Chapter II, we are concerned 
specifically with FEC codes. 

A parameter used to define a FEC code is its code rate, r, where 



n 


where k represents the number of bits into the channel encoder, n is the number of bits out 
of the encoder, and k<n. The code rate, equivalently represented using the notation (n, k), 
is a measure of the information contained per codebit. Large values of n relative to k are 
better for error-free coding, but the cost is in increased transmission bandwidth require¬ 
ments. The two most common types of FEC codes encountered in communications systems 
are block codes and convolutional codes. [Ref. 1] 

For block codes, the encoder partitions a given input information sequence into k- 
bit words and represents each input word by an n-bit codeword, where the codeword is 
often the input word appended with n - k bits. If an input message sequence is partitioned 

into words that are k bits long, there are a total of 2^ possible information words and 2^ 
corresponding codewords. These codewords are designed to meet some predetermined 
error-correcting requirements and are fixed after they have been computed. Encoding is just 
a mapping of an information word to its respective codeword. At the decoder, the received 
codeword, which may have been altered due to noise, is mapped to the codeword with 
which it has the least difference followed by recovery of the corresponding information 
word. For more detailed information on block codes see Gibson [Ref. 1] and Roden [Ref. 
8 ]. 
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Convolutional codes form the second major category of FEC codes but are encoded 
and decoded in a very different manner than block codes. Today they are used in a variety 
of communication links, such as modern satellite communication systems. For this reason, 
the remainder of this chapter will deal exclusively with the encoding, decoding, and 
implementation of convolutional codes. The development will be limited to rate 1 /n codes 

with specific attention focused on r = 1/2 codes because they are a good representation 
of convolutional codes. 

A. CONVOLUTIONAL CODES 

For («,1) convolutional codes, each bit of the information sequence into the encoder 
results in an output of n bits. However, unlike block codes, the relationship between 
information bits and output bits is not a simple one-to-one mapping. In fact, each input 
information bit is ‘convolved’ with K-\ other information bits to form the output n-bit 
sequence. The value K is known as the constraint length of the code and is directly related 
to its encoding and decoding complexity as described below in a brief explanation of the 
encoding process. 

For each time step, an incoming bit is stored in a A" stage shift register, and bits at 
predetermined locations in the register are passed to n modulo-2 adders to yield the n output 
bits. Each input bit enters the first stage of the register, and the K bits already in the register 
are each shifted over one stage with the last bit being discarded from the last stage. The n 
output bits produced by the entry of each input bit have a dependency on the preceding K- 
1 bits. Similarly, since it is involved in the encoding of K -1 input bits in addition to itself, 
each input bit is encoded in nK output bits. It is in this relationship that convolutional 
coding derives its power. For larger values of K, the dependencies among the bits increase, 
and the ability to correct more errors rises correspondingly. But the complexity of the 
encoder and especially of the decoder also becomes greater. 


22 



Shown in Figure 4.1 is the schematic for a (2,1) encoder with constraint length K = 
3 which will serve as the model for the remainder of the development of convolutional 
coding. In the coder shown, the n = 2 output bits are formed by modulo-2 addition of the 
bits in stages one and three and the addition of bits in stages one, two, and three of the shift 



register. For example, consider the input sequence given by 

V = 11011.... 

If the left-most bit is the earliest in time and enters the shift register first, then assuming the 
register is empty (i.e., filled with zeros) at the beginning of the encoding process, the first 
two output bits, Mj j and mj 2 , produced by the coder by input bit vj = 1 would be 

Mj j = 1 © 0 = 1 

Mj2 = 1©0®0= 1. 

The © symbol represents modulo-2 addition. The rest of the output sequence is: 

u=10 1000 10... 

Each pair of output bits, Mj-j can be more conveniently written as codesymbol u,- 

It is also common to identify the coder by its code vectors. These length K vectors 
identify the bit locations in the shift regitster which are used by the n modulo-2 adders; 
therefore, there are n code vectors which represent the coding scheme. For the example 
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case, the code vectors are Cj = [1 0 1] representing stages one and three, and C 2 = [1 1 1] 
for all three stages. 

1. State Diagram 

To assist in analyzing the code and determining its output for a given input 
sequence, there are various representations of convolutional codes which are helpful. These 
include polynomial representations, state tables, code tree, state diagrams, and trellis 
diagrams. All of these contain the same information but they each provide different insights 
into the workings of the code. For detailed information on all of the descriptions and their 
mathematical foundations, see Sklar [Ref. 2]. We will briefly focus on state and trellis 
diagrams. 


Figure 4.2 shows the state diagram for the coder introduced in the previous section. 



There are four states associated with the coder and represented by the state bubbles lettered 
A through D. Each state is uniquely identified by the two bits in the first two stages of the 

shift register; since the input data is binary, there are 2^ states. In general, for an (n,l) con¬ 
volutional coder of constraint length K, there are 2^"' states. The state transition paths show 
the state changes that occur for a given input bit, Vp along with the correponding output bits. 


24 







produced during the transition. Given the initial state of the coder, the coded output 
for an input data sequence can be easily determined by moving from state bubble to state 
bubble along the appropriate state transition path for each input bit (either 0 or 1) and read¬ 
ing the associated output bits. The state diagram representation can also be put in tabular 
form which is more manageable as K gets larger. 

2. Trellis Diagram 

Another insightful representation of convolutional codes is the trellis diagram. The 
trellis diagram in Figure 4.3 can be quickly obtained from the state diagram shown above. 



- Input bit 0 

-Input bit 1 

Figure 4.3 Trellis Diagram of K=2>, r = 1/2 Encoder. After Ref [2] 


In the trellis representation, states A through D are represented by four nodes and all of the 
possible state transitions are shown beginning at some initial state (which is at A = 00 in 
this case). As a new bit is input to the coder at each time sample, n,-, the trellis depicts the 
possible progression of the codeword from state to state where solid lines represent an input 
bit of 1 and dashed lines indicate a 0 input bit. In other words, given the initial state of the 
coder. Figure 4.3 shows all of the possible paths through the trellis corresponding to all of 
the possible input sequences. It is clear that a repetetive pattern emerges a few time steps 
from the initial condition, and it continues forever. The pattern is merely the ‘unwrapped’ 
state transition diagram of Figure 4.2 where each state transition path between state bubbles 
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in Figure 4.1 is replaced by a trellis branch between the corresponding nodes. The trellis 
representation is a very important tool when implementing the Viterbi decoding algorithm 
to be described later. 

3. Systematic vs. Nonsystematic Codes 

A code is systematic if the encoder forms the output codeword by appending n - k 
bits to the input information word. In the case of block codes, any non-systematic code can 
be reduced to a systematic form and still keep its designed error correcting abilities. The 
performance of convolutional codes depends on its free distance, df The dfis a measure of 
the code’s error correcting capability such that a code with a larger djVjiW be able to correct 
more errors. The number of errors a code can correct is given by 



where the [.xj operation indicates the integer closest to but not greater than the argument 
X [Ref. 2]. For convolutional codes, the non-systematic version will generally have a larger 
df than its systematic version and will be able to correct more errors. In the case of the K = 
3, r = 1/2 coder, the codevectors Cj and C 2 listed earlier are the non-systematic code vectors 
which yield a t^of 5 and t = 2. More detailed information concerning systematic codes and 
calculations of free distance can be found in Sklar [Ref 2] and Riccharia [Ref 9]. 

B. VITERBI ALGORITHM 

In general, there can be errors in the received data stream. For the correct code 
sequence to be recovered, these errors need to be resolved. Channel decoding is performed 
at the receiver after demodulation and prior to source decoding to attempt to resolve these 
errors. Convolutional decoding involves uncovering the transmitted message sequence by 
using the inherent coupling of information between bits introduced at the encoder to 
remove the errors. Because the size of the transmitted code sequence is unknown to the 
receiver, it must allocate enough memory resources to store and decode large sequences. If 
the initial state of the coder is not known, the decoder must also keep track of all of the 
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possible code sequences in order to perform an accurate decoding operation. The best 
decoding schemes attempt to quickly locate and update the most likely sequences while 
quickly discarding the least likely ones. Two of the more commonly used decoding 
algorithms are the sequential decoding and Viterbi decoding algorithms of which only the 
Viterbi method will be discussed here [Refs. 2 and 7]. 

1. Maximum Likelihood Decoding 

If every output data sequence produced by the channel encoder is equally likely and 
if the probability of bit error, Pf,, is the same and independent for each transmitted 0 or 1 in 
the sequence, then a maximum likelihood decoding procedure can be used by the channel 
decoder [Ref. 2]. The decoding procedure involves finding the channel encoder output 

sequence, 1/”*“, which satisfies the following relationship: 

= majc|p(Uj.|l7'”)|, for all t/” (4.3) 

where Uj. is the received code sequence, I/® is one of the possible message sequences, and 
represents the conditional probability of receiving the code sequence given 

the sequence Since the solution involves a complete search over all possible message 
code sequences, the answer represents the most likely transmitted sequence. The condition¬ 
al probability for each message sequence can also be calculated based on the conditional 
probability of occurence of each bit in given the bit for that time step in the correponding 
message sequence, 

OO 

/’(Ur If/”) = (4-4) 

i = 1 

where u^ and C/, are the ith bits in the received sequence and the reference message se¬ 
quence, respectively. Sometimes it is useful to take the logarithm of this result in order to 
simplify the calculation to a series of addition operations. See Therrien [Ref 7] for more 
details. 
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2. 


Viterbi Decoding Algorithm 


The biggest practical constraint imposed by the maximum likelihood decoding 
method is that for long message sequences produced by large constraint coders, the amount 
of memory and time required to store and caleulate all of the conditional probabilities for 
all of the sequences becomes prohibitive and in practice it is unrealizable. The Viterbi 
algorithm uses the method of maximum likelihood decoding with a few modifications to 
reduce this workload. 


a. Preliminaries 

Before explaining the Viterbi procedure, two quantities need to be defined; 
the Hamming weight and the Hamming distance. The Hamming weight, of a binary 

sequence is equal to the number of Is in the sequence. For example, the codeword Uj = [1 
0 01 10] has = 3. A related quantity is Hamming distance, d^, which is the number 

of bit by bit differences between any two binary words of the same length. For the 
codewords Uj above and U 2 = [l 1010 0], <i//(uj, U 2 ) = 2 since they differ in a total of two 

bit locations (at positions 2 and 5). The Hamming distance and Hamming weight are related 
as follows: 

w^(Ui © U 2 ) = djjivL^, U 2 ) (4.5) 

where the © symbol once again indicates modulo-2 addition. Using the two codewords 
above, 

"1 ® "2 = [010010]. 

and using equation (4.5), 

W^(Uj©U2) = 2 = dfj(u^,U2). 

b. Decoding Procedure 

Instead of using the conditional probabilities of the received code sequence 
as the decision criterion, the Viterbi algorithm uses the metric defined by the Hamming 
distance to determine the most likely transmitted code sequence. Specifically, the code 
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sequence with the smallest total Hamming distance from the received sequence is the most 
likely transmitted message. 

The Viterbi procedure can be more easily understood by referring to the 
trellis representation of Figure 4.4, with specific attention on the repetetive pattern which 
emerges at step 03 . The important feature to notice is that each node has two entry branches 
and that these remain fixed. These branches are denoted as iyj and iji (where 

(i, 7 T,; 2 )e {A,B, C,Z)}) whereyl andy^ are the source nodes which lead to node i. Each 
branch is associated with the output code symbol resulting with a transition from state j to 
state i. For example, and are the two branches which originate from nodes A and C 
and lead to node A with Aa= 00 and Ac=l 1. As the received code sequence Uj. arrives, it is 
parsed into code symbols of length two bits, u^/ (/ = 0 , 1 , 2 ,...), and these code symbols are 
fed to the decoder. At iteration step I, the Hamming distance between code symbol u^/ and 
each of the 8 branches is calculated resulting in two Hamming distances per node. For node 
A, these are and for example. Each node also has an associated total 

path metric, This is the Hamming distance between the entire path in the trellis which 
leads to node i at step /, ipathi^), and the received code sequence u^C/) as of step /, 

p/(i) = df^{}x^{l),ipathil)) (4.6) 

computed at the end of step 1. The two Hamming distances per node are summed with the 
respective total path metrics of the source nodes calculated during the previous iteration: 

+/’/-! (/I) 

and 

+Pz-l(/2). 

The total path metric for node i at step I is the smaller of the two values, 

Pid) = rmn{dH{u^i,iji) + Pm (/I), dfi(u^i,ij2) + Pm(/2)} (4.7) 

In this way, the path which corresponds to the lesser total distance is kept while the other 
is discarded. After a number of codesymbols of Uj. have been processed, the nodes in all of 
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the sequences some y steps behind the current iteration will converge to the same path 
for all of the states such that at step /, 

P/-y(A) = Pi.yi^) = Pi-yiC) = Pi.y(D). (4.8) 

This common path is part of the desired sequence and can be easily decoded to the original 
bit sequence. The process continues until only one path is left through the entire trellis and 
the entire sequence has been decoded. This bit pattern is the most likely transmitted infor¬ 
mation sequence and is subsequently fed to the source decoder for final processing. 

c. Hard vs. Soft Decisions 

So far in this section, the assumption is that the input to the decoder is either 
a 0 or a 1 bit, correponding to some defined two-level voltage assignment. This method is 
called hard decision coding because the detector preceding the channel decoder determines 
whether or not the received bit from the channel is a 0 or a 1. There is another 
implementation in which the detector outputs a multilevel voltage and the channel decoder 
bases its output on these inputs. This soft decision decoding procedure has been shown to 
provide a 2 - 3 dB coding gain over the hard decision process but comes with the price of 
increased receiver and channel decoder complexity [Ref. 10]. In this thesis only hard 
decision decoding was used. 

3. Memory Requirements and Computational Load 

It is apparent that the repetetive Hamming distance calculations and trellis path 
comparisons can lead to a loss in speed of operation of the algorithm. Furthermore, 
although the number of trellis branches that need to be stored decreases whenever a new 
minimized path is found, this does not necessarily happen at every iteration. There are a 
few factors which must be weighed against each other in order to optimize performance. 
These are the frequency of path metric calculations and branch comparisons, the desired 
speed of operation, the amount of branch history that needs to be saved, and memory 
limitations. 

While frequent calculations will reduce memory requirements and yield output data 
at a more constant rate, the time taken to perform the calculations and update the path 
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histories will slow down the overall execution time. For larger constraint length coders, 
since more bits are used to encode the information, it is not necessarily productive to 
perform frequent calculations since the decoder will require more received bits to resolve 
any errors through the trellis. In other words, the decoder might end up doing all of the 
calculations yet not produce any output. 

For rate Hn, constraint length K coders, there are 2^'^ paths that must be stored. The 
amount of path history that is retained for these paths is also critical to the decoder’s 
performance. If too much path history is kept, then large amounts of memory are used. 
Coupled with infrequent calculations and trellis updates, the computational overhead 
becomes massive during each step when calculations are performed. On the other hand, the 
probability of resolving that part of the sequence correctly increases because there is more 
information with which to work. Obviously, there needs to be a compromise in order to 
minimize demands on memory and computational load while maximizing decoding 
efficiency. 

It has been shown that the total amount of storage required by the decoder to 
maintain all of the state histories, so the input can be corrrectly recovered is 

s = h2^~\ (4.9) ^ 

where s is the total trellis branch storage requirement, h is the length of the information path 

for each state, and 2^'^ are the number of states. The value of h is generally betweeen 4 to 
5 times the constraint length [Ref. 2]. 

C. IMPLEMENTATION 

The source code for the channel encoders and decoders used in this thesis are given 
in Appendix B. The details of the codes used and their implementation are given below. 

1. Encoders 

In today’s communication systems, there are a variety of convolutional coders in 
use. Two commonly used varieties are the AT = 5 and the AT = 7, r = 1/2 coders. The 
parameters of these codes are given in Table 4.1. The code vectors are each for the non- 
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systematic versions yielding the listed free distances. Coding was straightforward and the 
code for the more involve K=1 coder is provided as file K7_enc.C in Appendix B. Both 
the input and output data are in unpacked unipolar binary format. The encoder will take any 
binary data in unipolar format and output the coded sequence. Additionally, after the last 
bit enters the coder, it is flushed through the shift register with six zeros to fully involve it 
in the encoding process. 


TABLE 4.1; Convolutional Code Parameters 


K 

code vectors 

free 

distance 

t 

5 

10111 

11001 

7 

3 

7 

1001111 

1101101 

10 

4 


2. Viterbi Decoder 

The following information is specific to the = 7 decoder. Similar results hold for 

the less complicated K=5 decoder. For aisT = 7 code, there are 2^'^ = 64 states. The states 
and their associated Hamming weights were found using MATLAB. Both the states and 
Hamming weights are coded in a lookup table included in header files. The path history, h, 
for each state is limited to 40 which is slightly larger than suggested in the preceding 
section, but it is used to ensure near optimal performance. 

When the decoding process begins, the algorithm proceeds without any knowledge 
of the initial state of the encoder. This makes decoding a more generalized procedure in th'e 
event the shift register of the coder was not flushed with zeros prior to encoding or if 
multiple data streams are encoded back to back. After every two bits enter the decoder, the 
128 Hamming distances (two per node) appropriate for that input pair are accessed from a 
lookup table and summed with the total path metrics of the corresponding source nodes 
from the previous iteration. This yields 128 paths through the trellis of which only half are 
kept. This add, compare, and select routine is performed at every iteration. 
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All 64 paths are checked against each other every 15 iterations to determine whether 
their ipath^l) sequences are the same. This corresponds to 30 input bits. When all the 
common nodes are found, they are decoded and the outputs are written to a file. These 
nodes are then flushed from memory, the path histories are all reset with new starting points 
at the most recent common source node. This way, data is routinely being output, and the 
memory requirements are minimized. 

There is a chance that, at the tail end of the incoming message sequence, there might 
not be enough coded bits to fully resolve the message sequence. This results in a minimal 
bit loss on the order of 15 - 20 bits. When compared to the total bit length of the message 
sequence, this is minor; however, this problem can be averted by appending more encoded 
bits to the sequence (i.e., flushing more zeros through the coder) thereby providing the 
decoder with more information. 

The decoder routine is under the filename Viterbi_7.C in Appendix B. 
Additionally, the header files vit7_consts.h, vit7hamwts.h, and vit_functions.h contain 
lookup tables and function routines used to compare, select, update, and decode paths 
through the trellis. The code for the K=5 decoder is similar to that given for the K-1 coder 
with some simplifications and changes in decoding parameters. 

FEC codes provide a method of error correction in a transmitted data sequence. 
Although they increase the data rate in a system, the benefits of FECs under noisy 
conditions outweigh this inherent characteristic. Convolutional codes and Viterbi decoding 
are widely used methods of implementing FECs. The computer code contained in 
Appendix B is an implementation of a widely used convolutional code and suited for 
channel simulations. 
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V. DIGITAL MODULATION 


In digital transmission systems, the data sequence from the channel encoder is 
partitioned into L bit words, and each word is mapped to one of M corresponding 

waveforms according to some predetermined rule, where M = 2^. For example, in the case 
of baseband modulation, the words are L = 1 bits each where each bit correponds to one of 
two different voltage levels. As we shall see later, in a QPSK modulation system, the 

incoming sequence is separated into words of L = 2 bits each and mapped to M = 2 =4 
different waveforms. During transmission, the channel causes attenuation and introduces 
noise to the signal. The net result is the formation of a version of the original signal which 
may not be detected correctly by the receiver. If the errors are too numerous, the channel 
decoder may not be able to resolve the information correctly. Baseband modulation using 
the simple binary symmetric channel model is briefly discussed, and the details of QPSK 
modulation are then presented. 

A. BASEBAND MODULATION 

Baseband modulation is the process of transmitting each bit in a sequence by means 
of a voltage pulse. The bit information is either contained in an assigned voltage level or 
in the transition between two different voltage level. In this thesis, the two baseband 
representations used are the unipolar and the bipolar formats. Unipolar format assigns a 
high voltage level for seconds to one of the bits and a zero voltage level of the same 
duration to the other. The value is the bit duration time and is the reciprocal of the data 
rate, R. For the bipolar format, the bits are assigned voltages of the same magnitude but of 
opposite sign. Usually, the 1 bit is assigned the higher voltage and the 0 bit is assigned the 
lower value. Further information on baseband representation is available in [Ref. 11]. 

1. Binary Symmetric Channel 

The binary symmetric channel (BSC) is the simplest model for a channel, and it is 
a special case of the discrete memory less channel (DMC). The inputs and outputs of a 
DMC come from finite discrete alphabets where each output depends on a set of 
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conditional probabilities given each input. In the case of a BSC, the input and output 
alphabets are both the same, namely {0,1}, and the ouputs are described by the conditional 
probabilities P(OIO), P(Oll), P(IIO), and P(lll). Because the channel is symmetric, the 
following relationships hold: 


P(011) = P(1I0) = P^ 

(5.1) 

P(0I0) = P(1I1)= 1 - p^. 

(5.2) 


A conceptual picture of a BSC is shown in Figure 5.1. Although it is easy to associate the 
BSC with a baseband modulation scheme, the waveform representation of the data se¬ 
quence is irrelevant because the BSC models the channel effects on the underlying binary 
data sequence regardless of the modulation method. The BSC is very useful for strictly 
modeling the effects of on the fidelity of the information sequence after it has been 
source decoded. Similarly, it can be used to test the effectiveness of the channel encoding 
and decoding routines. Both of these tasks were accomplished by coding a BSC and send¬ 
ing data across it. For additional reading on DMC and BSC, see Couch [Ref. 11]. 



Figure 5.1 Binary Symmetric Channel 


The coded BSC model using baseband modulation is contained in Appendix C as 
baseband.C. Upon execution, a user supplied is applied to a data sequence, and each 
bit is changed based on that probability, thereby simulating a bit error. 
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B. QPSK MODULATION 

In this section the topics of QPSK modulation of digital signals including their 
transmission, demodulation, and detection, are developed. The material in this section and 
the related coding of this system are both based on transmission using an AWGN channel 
model which is covered at the end of this section. Some of the techniques discussed below 
are specifically designed for robustness under these conditions. 

Because this is a digital implementation of a digital system, it is important to note 
that the only places where analog quantities occur are after the DAC, prior to the actual 
transmission of the signal, and before the ADC at the receiver. All signal values between 
the source encoder input and modulator output are purely digital. This also holds for all 
quantities between the demodulator and the source decoder. 

1. Background 

QPSK modulation is a specific example of the more general M-ary PSK. For M-ary 
PSK, M different binary words of length L = log 2 M bits are assigned to M different 
waveforms. The waveforms are at the same frequency but separated by multiples of cp = 
271/M in phase from each other and can be represented as follows: 

s^{n) = cos(^27C^n + , <P, = ^ (5-3) 

with i= 1,2,... M. The carrier frequency and sampling frequency are denoted by and 
/j, respectively. 

Since an M-ary PSK system uses L bits to generate a waveform for transmission, its 
symbol or baud rate is 1/L rimes its bit rate. For QPSK, there are M = 4 waveforms 
separated by multiples of (p = lUl radians and assigned to four binary words of length L = 
2 bits. Because QPSK requires two incoming bits before it can generate a waveform, its 
symbol or baud rate, D, is one-half of its bit rate, R. 
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2 . 


Transmitter 


Figure 5.2 illustrates the method of QPSK generation. The first step in the 
formation of a QPSK signal is the separation of the incoming binary data sequence, b, into 
an in-phase bitstream, bj, and a quadrature phase bitstream, bq, as follows. If the incoming 

data is given hyb = bg, bj, b 2 , b 3 , b 4 ,... where b,- are the individual bits in the sequence, then 
bj = bo, b 2 , b 4 ,... (even bits of b) and = bj, bg, b 5 ,... (odd bits). The digital QPSK signal 
is created by summing a cosine function modulated with the bj stream and a sine function 
modulated by the bq stream. Both sinusoids osciallate at the same digital frequency, oo^ = 
radians. The QPSK signal is subsequently filtered by a bandpass filter, which will 
be described later, and sent to a DAC before it is finally transmitted by a power amplifier. 
Details on the DAC and subsequent transmission operations can be found in Frerking [Ref. 
5] and Freeman [Ref. 10]. 



a. Signal Constellation 

It is often helpful to represent the modulation technique with its signal space 
representation in the I-Q plane as shown in Figure 5.3. The two axes, I and Q, represent 
the two orthogonal sinusoidal components, cosine and sine, respectively, which are added 
together to form the QPSK signal as shown in Figure 5.2. The four points in the plane 
represent the four possible QPSK waveforms and are separated by multiples of 71/2 radians 
from each other. By each signal point is located the input bit pair which produces the 
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respective waveform. The actual 7 and Q coordinates of each bit pair are the contributions 
of the respective sinusoid to the waveform. For example, the input bits (0,1) in the second 
quadrant correspond to the {I,Q) coordinates, (-1,1). This yields the output waveform 
-1 + Q = - cos(a)^n) -i- sin((0^n). Because all of the waveforms of a QPSK have the 

same ampilitude, all four points are equidistant from the origin. Although the two basis 
sinusoids shown in Figure 5.2 are given by cos(cOon) and sin(a)on), the sinusoids can be any 
two functions that are orthogonal. 



b. Filtering 

The QPSK signal created by the addition of the two sinusoids has significant 
energy in frequencies above and below the carrier frequency. This is due to the frequency 
contributions incurred during transitions between symbols which are either 90 degrees or 
180 degrees out of phase with each other. It is common to limit the out of band power by 
using a digital bandpass filter (BPF) centered at cOq. The filter has a flat passband and a 

bandwidth which is 1.2 to 2 times the symbol rate [Ref. 5]. ^ 

>• 

■■T 

3. Receiver 

The receiver’s function consists of two steps: demodulation and detection. 
Demodulation entails separating the received signal into its constituent components. For a 
QPSK signal, these are the cosine and sine waveforms carrying the bit information. 
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Detection is the process of determining the sequence of ones and zeros those sinusoids 
represent. 

a. Demodulator 

The demodulation procedure is illustrated below in Figure 5.4. The first 


step is to multiply the incoming signal by locally generated sinusoids. Since the incoming 



Figure 5.4 QPSK Demodulator and Detector 


signal is a sum of sinusoids, and the receiver is a linear system, the processing of the signal 
can be treated individually for both components and summed upon completion. Assuming 
the received signal is of the form 

r{n) = A^cos(co^n) + AgSin((0^n) , (5.4) 

where A/ and Ag are scaled versions of the bj and bg bitstreams used to modulate the signal 
at the transmitter. The contributions through the upper and lower arms of the demodulator 
due to the cos(©^«) input alone are 

r^i(n) = A^cos(©^n)cos(( 0 „n + 0) (5.5) 

and 

r^g(n) = A^cos(to^n)sin(( 0 ^n + 0) (5.6) 

where 0 is the phase difference between the incoming signal and locally generated sinuso¬ 
ids. These equations can be expanded using trigonometric identities to yield 




r^j{n) = 0.5A/[cos(2co^n + 0) + cos(-0)] 


(5.7) 


and 



r^g(n) = 0.5Ay[sin(2co^« + 0) - sin(-0)]. 

(5.8) 

The LPFs remove both high frequency terms leaving only the terms , 



d^j{n) = O.5A;COs(-0) = O.5A^cos(0) 

(5.9) 

and 

dcQ{n) = (-O.5)AySin(-0) = O.5A;sin(0). 

(5.10) 


These terms are proportional to the phase difference between the incoming signal and the 
locally generated sinusoid. Using a similar development for the sin(a)^n) portion of the 
input r{n), the outputs of the upper and lower arms of the demodulator are 


d^i{n) = (-O.5)AgSin(0) (5.11) 

d^gin) = O.5AgCOs(0) (5.12) 

The demodulated bitstreams dj(n) and <ig(n) are found by summing the signal components 
in (5.9) and (5.11), 

djin) = O.5A;COS(0) + (-O.5)AgSin(0), (5.13) 

and (5.10) and (5.12), 

dgin) = O.5A;Sin(0) + O.5AgCos(0). (5.14) 


For 0 = 0, equation (5.13) produces a scaled version of the in-phase bitstream while equa¬ 
tion (5.14) is the quadrature phase component. The-goal of driving 0 to zero is realized with 
a phase locked loop (PLL). 

b. Synchronization 

For the received data to be detected and interpreted correctly, there needs to 
be coordination between the receiver and the transmitter. Since they are not physically 
connected, the receiver has no direct means of knowing the ‘state’ of the transmitter. This 
state includes the both the phase argument of the modulator and the bit timing of the 
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transmitted data sequence. The receiver must therefore extract the desired information 
from the received digital signal to achieve synchronization. 

A common means of accomplishing synchronization is with a PLL as 
shown in Figure 5.5. The inputs to the PLL are the outputs from the two LPFs in Figure 
5.4. As shown in equations (5.13) and (5.14), the outputs of both LPFs are a function of 
the phase difference, 0, between r{n) and the sinusoids. The two demodulated data streams 



Figure 5.5 Phase Locked Loop for QPSK Synchronization After Ref. [5] 

are each multiplied by the sign of the other stream, using the limiters, 

e^{n) = sign{dQ{n))dj{n) (5.15) 

e^{n) = sign{d,{n))Q{n), (5.16) 

and subtracted to yield the error signal input, e^{n), to the loop filter: 

e^{n) = sign{dQ{n))di{n)-sign{d,{n))Q{n). (5.17) 

The loop filter is an FIR low pass filter given by [Ref. 5] 

N= 9 

yin) = (0.8)'jf(n-0 (5.18) 

( = 0 

where the output y{n) depends on a weighted average of its previous inputs, x{n). The out¬ 
put is sent as the phase argument to the voltage controlled oscillator (vco) which produces 
the two sinusoid outputs used to demodulate the received signal, r{n). 
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c. Detection 

After the signal r{n) has been demodulated into the bitstreams di{n) and 

the corresponding bit information must be recovered. The commonly used 
technique is to use a matched filter at the output of each LPF as shown in Figure 5.4. The 
matched filter is an optimum receiver under AWGN channel conditions and is designed to 
produce a maximum output when the input signal is a mirror image of the impulse response 
of the filter. The outputs of the two matched filters are the detected bitstreams b^j and b^q, 
and they are recombined to form the received data bitstream. The development of the 
matched filter and its statistical properties as an optimum receiver under AWGN conditions 
can be found in various texts [Refs. 3 and 7]. 

4. AWGN Channel 

The previously introduced BSC channel modeled all of the channel effects with one 
parameter, namely the BER; however, this model is not very useful when attempting to 
more accurately model a communication system’s behavior. The biggest drawback is the 
lack of emphasis given to the noise which significantly corrupts all systems. 

The most commonly used channel model to deal with this noise is the additive white 
Gaussian noise (AWGN) channel model. The name results because the noise is simply 
added to the signal while the term ‘white’ is used because the frequency content is equal 
across the entire spectrum. In reality, this type of noise does not exist and is confined to a 
finite spectrum, but it is sufficiently useful for systems whose bandwidths are small when 
compared to the noise power spectrum. When modeling a system across an AWGN 
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channel, the noise must first be filtered to the channel bandwidth prior to addition. An 
illustration is shown in Figure 5.6. 



Figure 5.6 Application of AWGN Channel 


C. IMPLEMENTATION 

All code for this chapter is contained in Appendix C. There are a variety of LPFs 
and BPFs used in the routines, and the coefficients of these filters are documented in the 
code. All filters were designed using the MATLAB functions butter, 2 et\d filter in the 
Signal Processing Toolbox [Ref. 12]. All of the repeatedly used values such as cos, sin, 
and filter coefficients are retrieved from lookup tables to reduce computational load. The 
entire QPSK system is coded as one function to reduce execution time. Specifically, even 
though the inputs to the modulator and the outputs from the detectors are binary, all of the 
intermediate values are in floating point. By combining all of the functions, the large 
amounts of time used in processing the floating point values and the associated file input/ 
outputs are reduced. The files used are qpsk.C, qpskmod.h, demod.h, bpf.h, and noise^h 
which are included in Appendix C. 

1. Modulator 

The modulator uses the output from the channel encoder to create the QPSK signal. 
Because the output of the channel encoder is in unipolar form, it is first converted to bipolar 
form with all 0 bits changed to the value -1. The modulated frequency is 455 kHz which 
is commonly used as an intermediate frequency in superheterodyne receivers. All 
calculations were performed at this frequency, and the modulated frequency will be 
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denoted as/^ even though it is not the carrier frequency. The sampling frequency,/^ , = 8/^ 
= 3640 kilosamples/second, and the sample time is, = l/f^ 

Using CELP coded speech, the output bit rate of the convolutional coder isR = 9600 
bps. Using QPSK modulation which needs L = 2 bits to generate a waveform results in a 
symbol rate, D, of 4800 baud; therefore, the number of samples per baud, iV^, is given by 


■f 

N,, = roundl ^ 1. 


(5.19) 


Given the input bitstream b, every seconds, the bits Z?,- (in-phase bit) and (quadrature 
phase bit) are multiplied by a cosine and a sine function, respectively, and summed. After 
A), samples, the bits bi+T, are used and so on until the entire bitstream is used. For 

the given and D, N/, = 758 samples/symbol. 

Both sinusoids are obtained from a lookup table of 8 values (because = 8/^) to 
reduce computations and minimize the program run time. After modulation, the digital 
signal is filtered with a 6th order digital Butterworth bandpass filter with the digital 
passband frequencies given by 


(0 


cl 


(/,-0-3i?) 

fs 


(5.20) 


© 


c2 


(/c + 0.3/f) 

fs 


(5.21) 


These correspond to the real frequencies 452.12 kHz and 457.88 kHz, respectively. All 

data is output to a file with floating point precision. The filter coefficients for both the BPF 

and the interpolation filter and other details are documented in the code in Appendix. For 

• ^ 

more details on the filtering operations and routines see Kraus [Ref. 12] and Embree [Ref.'^ 
13]. 


2. Demodulator/Detector 

For a given/c,/^, and associated filter parameters, the delay of the modulated signal 
through the system is fixed. Using this delay, the received signal can be demodulated 
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correctly. The additional delay through the demodulator LPFs is similarly fixed and can be 
used to correctly set the bit timing for bit retrieval from the matched filter detectors. In this 
system, with = 455 kHz and = 3640 kilosamples/s, the full system delay from 
modulator to matched filter is 452 samples, so the first 452 samples arriving at the matched 
filter are discarded since these do not represent any of the data. Since there are = 758 

samples per bit, the matched filters produce a maximum output value every 758 samples. 
Therefore every 758 input values to each matched filter are stored in a buffer, and the dot 
product of this buffer with the 758 matched filter coefficients is used to determine the 
output bit value. The dot product is the FIR filtering operation of the matched filter. A 
threshold is used such that a positive result corresponds to an output bit of 1 and a 0 
otherwise (see Figure 5.4). 

3. AWGN Channel 

Gaussian noise is generated using the Box-Muller random number generator 
algorithm [Ref 13]. The noise variance, cj^awgn> is user defined. Prior to adding the noise 
to the QPSK signal, it is filtered with the same BPF used to filter the digital QPSK signal. 
The filtered noise is added to the filtered QPSK signal prior to the demodulation routine. 

Baseband modulation over a BSC provides a simple means of analyzing system 
performance and is beneficial when the details of the modulation scheme are not critical. 
QPSK is a widely used modulation method used in a variety of links, such as satellite 
communications. A coded QPSK system using an AWGN channel model provides a 
realistic, albeit rudimentary, method for analyzing the performance of a digital system. 
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VI. RESULTS 


The preceding chapter developed the details of a digital communication system and 
the implementation method of each of the system functions. Now, the goal is to test these 
functions and to successfully integrate them, allowing data to pass through the entire 
system. Specifically, the performance of the FEC channel coders, the QPSK modulation 
system, and the CELP coder are of interest. The following two sections contain the results 
of these tests. In each case, the primary data sets were short segments (3-4 seconds) of 
speech data obtained on the internet. 

A. BINARY SYMMETRIC CHANNEL 

1. Channel Coder 

As mentioned in Chapter V, for the purposes of testing the CELP coder and 
measuring the effectiveness of the channel coders, baseband simulation using a binary 
symmetric channel (BSC) model is the best approach. Using a random data stream of 
50,000 bits, the channel encoder produced 100,000 bits which were sent over the BSC with 
different values. These values were obtained via a user input interface during 
execution of the baseband channel program and were used to determine whether an input 
bit should be corrupted. The received data stream is decoded and the resulting total error is 
the number of bit errors between the decoded and the original bits. Ideally, the effective bit 
error rate (BER) after the channel decoder should be equal to zero. 

Table 6.1 shows the {performance of both coders under the various bit error 
probabilities. As the table shows, for a channel with P^ < 0.03, both K = 5 and AT = 7 

convolutional coders fully resolve the data. For 0.03 < P^ < 0.1, both coders improve the 
bit error rate but do not reduce it to zero; the K = 7 coder performs better than the AT = 5 
coder. For P^ > 0.1, both coders exhibit performance deterioration. 
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TABLE 6.1: Convolutional code effect on 


channel 

coder 
constraint 
length, K 

effective 
after decoder 

0.001 

5 

0.0 


1 

0.0 

0.01 

5 

0.0 


7 

0.0 

0.03 

5 

0.0 


7 

0.0 

0.05 

5 

0.003 


7 

0.002 

0.07 

5 

0.018 


7 

0.011 

0.1 

5 

0.24 


7 

0.08 


The positive effects of channel coding can best be seen with image data. Figure 
6.1(a) shows the original. The image dimensions are 480 x 640 pixels with each pixbl 
represented by an eight-bit grey scale, resulting in a total of approximately 2.5 Megabits. 
The image was first sent across the BSC with a = 0.05 but without any channel 

encoding, and the result is shown in Figure 6.1(b). Finally, Figure 6.1(c) is the result of 
using aK=5 channel encoder and is clearly much improved over the previous rendition; 
however, this performance was obtained with the transmission of twice as many bits, nearly 
5 Megabits, across the BSC. Nevertheless, the purpose of the illustration is to demonstrate 
the effectiveness of the channel encoding scheme (K = 5) detailed in Chapter IV. 
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Original 



(a) 


BER = 0.05 



(c) 

Figure 6.1 Effect of using a Channel Coder 
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B. QPSK MODULATION 


1. System Performance 

Performing the QPSK simulation using the CELP coder involved the use of a 
number of executables including both the main system functions and interface ‘modules’. 
These modules convert the outputs of the main system functions into the appropriate data 
formats necessary at the inputs of the succeeding system functions. Appendix D contains 
the sequence of executables necessary to perform a QPSK simulation. 

The performance of the QPSK system is measured by its channel bit error rate (P^), 
before any channel decoding, for a given EjJNg (in dB). E^, and are calculated at the 
inputs of each matched filter. The measured value of E^, is 0.00002 Joules/bit. To determine 
the noise power spectral density, the variance of the filtered AWGN is divided by the 
system bandwidth, B = 5760 Hz. The MATLAB script ebno.m in Appendix E is used to 
calculate the Efy/N^ values for different input standard deviation, a^wGN’ values. The 
actual BER was determined by comparing the actual bit patterns before modulation with 
those obtained after detection (the bit differences in the files are counted using MATLAB) 
but prior to channel decoding. 

Figure 6.2 shows a plot of the probability of bit errors as a function of the signal-to- 
noise ratio {E^Ng). The solid line indicates theoretical performance, obtainable from Sklar 
[Ref. 2] and Freeman [Ref. 10], and the measured performance is shown by the dotted line; 
the dashed portion of the line indicates projected values. The theoretical curve is obtained 
from the following relation: 



where Q{-) is the error function. A complete derivation of equation (6.1) for a QPSK sys¬ 
tem is provided in Sklar [Ref. 2]. Although the system performance falls short of the ex- 
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pected theoretical values, the measured points follow the same shape of the curve. The mea¬ 
sured values are on average displaced from the theoretical values by about 2 to 3 dB, which 



Figure 6.2 vs (Theoretical and Measured Values) 

can be accounted for implementation loss resulting from the non-ideal frequency response 
characteristics of the channel filter. 

Table 6.2 shows the effective BER at the source decoder for several E^Nq values 
when using a channel coder (K = 7). For E^Nq > 5 dB, the channel decoder corrected all 
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of the errors while for E^/N^ < 5 dB, the resultant bitstream contained various errors which 
the channel decoder could not fully resolve. 

TABLE 6.2: BER Results using K = 7 Channel Coder for Several Ej/N^ 


EblN^idB) 

P^ (from 
Figure 6.2) 

effective 
system P^ 

11.2 

0.00001 

0 

4.8 

0.06 

0.003 

1.6 

0.1 

0.08 


C. CELP RESULTS 

CELP is a widely used speech compression/coding scheme, and its performance in 
terms of speech intelligibility is considered good for cases where there are little or no bit 
errors. The question is how well it performs under higher levels of bit error. To analyze the 
performance of the CELP algorithm, the coded speech was channel coded {K -1 coder) 
and passed across the entire QPSK system using different values. Following the 

Viterbi decoder, the data stream with bit errors was passed to the CELP decoder for final 
processing and speech reproduction. 

With any speech reproduction system, the results are subjective, but they can be 
expressed using the mean opinion score (MOS) tables [Ref. 4]. Table 6.3 contains the MOS 
values for CELP coded speech samples with different BER.For < 0.001, the decoded 

CELP output was intelligible and had good fidelity as expected; for 0.001 < < 0.05 the 

TABLE 6.3: MOS Values of CELP Coded Speech for different BER 


Pb 

MOS 

0.001 

4.2 

0.010 

3.5 

0.030 

2.7 

0.050 

2.1 
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CELP decoder output was intelligible, but there was some loss in fidelity. Finally, for 
> 0.05, the audio output rapidly degraded to being unintelligible with little fidelity. 

Because CELP provides a compression of 27:1, each bit in a coded CELP frame carries 
more information than a bit in the corresponding uncoded linear PCM bit stream. This 
means that higher BERs result in more severe distortion in the CELP decoded output than 
in the uncompressed speech 

In summary, the results clearly show the benefits of using channel coding and the 
successful implementation of the QPSK system in software. Specifically, the channel 
coders effectively reduced the BER to zero when channel bit errors are below 0.03. Poor 
performance at higher BERs is expected when implementing systems with memory 
constraints and fast speed requirements. The QPSK system performance closely paralleled 
theoretical results. The CELP algorithm provides an effective, low bit-rate speech 
representation. For moderate amounts of errors, < 0.03, the decoded speech is 
intelligible with good waveform fidelity; however, for higher error rates, the decoder 
performance is poor and unusable. t ■ 
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VII. CONCLUSIONS 


A. CONCLUSIONS 

In this thesis the details of a specific digital communication system implementation 
have been discussed. Specifically, the mechanisms necessary to use QPSK modulation and 
convolutional coding to transmit compressed speech have been examined. CELP 
compression is an efficient speech compression technique which produces low data rate 
coding while maintaining the intelligibility of the processed speech. Convolutional coding 
and Viterbi decoding are established channel coding schemes which allow safe 
transmission of data under noisy, error producing conditions. Lastly, QPSK is an efficient 
modulation scheme currently used by modem satellite communication links. 

The full implementation shows the viability of coding a digital link in software and 
its use as a simulation tool for real data. The modular nature allows for interchangeability 
of components as was shown with the use of different constraint length channel coders.’ 
However, as currently coded, the system is not flexible in accomodating different carrier 
frequencies or sampling frequencies. This is due to the lack of a PLL which would resolve 
the synchronization issues and allow for the use of different simulation frequencies. 


B. FUTURE WORK 

For the purposes of testing the link and simulating the transmission of data, this 
software implementation provides a reasonable platform for future work. Additionally, the 
benefits of channel coding and effectiveness of various modulation schemes can be 
realistically represented. As it stands, the system is mdimentary and not optimized for 
speed. Future work may consist of: 

• optimizing the C code and porting it to a DSP chip, 

• using active filters and components to simulate channel conditions, 

• incorporating the link as a subsystem of a much larger network, 

• using the link to transmit image data, and 
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• enhancing the synchronization functions of the link (PLL). 

The above improvements and enhancements would eventually result in a complex but flex¬ 
ible functional implementation of a digital communication system. By further modulariz¬ 
ing the functions, one of the big advantages of programmable DSP could be exploited, 
namely, the ability to reconfigure parts of the link quickly without the difficulties involved 
in the manipulation of hardwired systems. 
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APPENDIX A. COMPUTER CODE FOR INTERFACING THE CELP 
CODER WITH THE COMMUNICATION SYSTEM 


The CELP encoder outputs coded speech frames in hexadecimal format. These hex 
values need to be converted to unipolar binary format prior to use by the channel encoder. 
Similarly, the channel decoder output must be converted back to hex frames of 144 bits 
each (36 hex characters) prior to use by the CELP decoder. The following two scripts 
perform these conversions. 


sic 4iH«*** ****************************************************** *************** 


filename: hex2bin.C 


Function: converts 144 bit framed CELP encoded data from hex to binary. 

Input: packed hex frames (36 characters/frame = 144 bits). 

Output: unpacked unipolar binary data. 

***** sje sj; sjc sjc sjc * sic s|c * s(c *** sjc ** sjc ** :|c * s}c * sK * sic ** s|e sje sic sic sfc St: 5|c sfe 5(c *** sic sjc sjc :}c 5|c sH *************************** / 


#include <iostream.h> 

#include <iomanip.h> 

#include <fstream.h> 

#include <math.h> 

#include <stdlib.h> 

#define FRAMESIZE 36 
#define ASCII 55 

main(int argc, char *argv[]) 

{ 

/* variables: 

* nibble - single hex character from a frame 

* displayMask - picks off each bit in nibble 

* frame [] - contains all 36 characters in one CELP frame 

*/ 


unsigned short nibble, displayMask = 0; 
unsigned char frame[FRAMESIZE+l]= {0}; 

/* file i/o. 

*/ 

if (argc != 3) { 

cout « “function: h2b converts celp ‘.chan’ files from hex -> binary” 
« endl « “Usage: h2b infile outfile” « endl « endl; 
exit(l); 

} 

else { 
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ifstream celpln(argv[l], ios::in); 
if (Icelpin) { 

cerr « “Input file “ « argv[l] « “ could not be opened “« endl; 
exit(l); 

} 


ofstream binOut(argv[2], ios::out); 
if (IbinOut) { 

cerr « “Output file “ « argv[2] « “could not be opened” « endl; 
exit(l); 

} 


/* read each frame of CELP: then convert to binary*/ 

while (celpin » frame) { 

/* read in each hex character of the frame. 

*/ 

for (int c = 0; c <=35; C++) { 
nibble = (unsigned short)frame[c]; 

/* convert from ASCII representation to 4 bit binary quantity 
*/ 

if ((nibble >= ‘0’)&&(nibble <= ‘9’)) { 
nibble = nibble - (unsigned short)’O’; 

} 

else { 

nibble = nibble - ASCII; 

} 

/* mask 3 bits of nibble starting with mask = 1 0 0 0 to pick of individual 
* bits in the nibble and output the bits to a file. 

*/ 

display Mask = 1 « 3; 
for (int cl = 1; cl <= 4; cl++) { 
binOut « (nibble & displayMask ? ‘1’ : ‘0’) « “ 
displayMask »= 1; 


} 

} 

binOut.closeO; 

celpIn.closeO; 

} 

return 0; 
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filename: bin2hex.C 


Function: convert binary stream into packed hex format for CELP synthesis - 
the hex nibbles are output to a file in frames of 36 hex/frame. 

Input: file of unpacked unipolar binary channel decoded data. 

Output: packed hexadecimal data - 36 hex/row (==frame). 

#include <iostream.h> 

#include <iomanip.h> 

#include <fstream.h> 

#include <stdlib.h> 


#define 

FRAMESIZE 

36 

#define 

NTOBLESIZE 

4 

main(int argc, char *argv[]) 


1 

/* variables 


♦ 

bit 

input bit (0 or 1) 

* 

nibble 

one hex character 

* 

bitCount 

keep track of 4 bits/hex 

* 

*/ 

nibbleCount - 

keep count of 36 nibbles/frame 


unsigned short bit, nibble = 0, bitCount = 3, nibbleCount = 0; 

/***/ 

if (argc != 3) { 

cout« “function: ‘bin2hex’ converts bitstream to packed “ 

« “hex frames for celp synthesis” « endl 
« “Usage: bin2hex infile outfile” « endl; 

} 

else { 

ifstream binln(argv[l], ios::in); 
if (!binln) { 

cerr« “Input file “ « argv[l] « “ could not be opened “« endl; 
exit(l); 

} 

ofstream hexOut(argv[2], ios::out); 
if (IhexOut) { 

cerr « “Output file “ « argv[2] « “ could not be opened” « endl; 
exit(l); 

} 


/* read in each bit (0 or 1). 
*/ 


59 



while (binin » bit) { 


nibble = nibble + (bit « bitCount); 

/* convert each nibble (= 4 bits) into its hex value, 
*/ 

if ((bitCount % NIBBLESIZE) == 0) { 
hexOut« hex « nibble; 
nibble = 0; 
bitCount = 4; 
nibbleCount+-f; 

if (nibbleCount == FRAMESIZE) { 
hexOut« endl; 
nibbleCount = 0; 


bitCount-; 

} 


/* if there weren’t enough bits at the end of the file to make a nibble, 
* convert whatever is left to hex and output. 

*/ 

if (bitCount != 3) { 
hexOut « hex « nibble; 
nibbleCount++; 

} 


/* if there were not enough nibbles (36) for a frame, output hex F to fill 
* the frame and exit. 

*/ 

if (nibbleCount != 0) { 

for (int c = nibbleCount; c < FRAMESIZE; C++) 
hexOut « hex « 15; 

} 

hexOut.closeO; 

binIn.closeO; 

} 

return 0; 

} 
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APPENDIX B. COMPUTER CODE FOR CONVOLUTIONAL 
CODERS AND VITERBI DECODERS 


This appendix contains the C++ code for the /if = 7 convolutional coders along with 
the code for iht K = 1 Viterbi algorithm and associated header files. The K = 5 
convolutional coderA^iterbi coder routines are similar to the ^ = 7 version with a few slight 
modifications. 


filename; fec7enc.C 

K=7, r=l/2 Convolutional Coder 

Function: channel codes input binary data using K=7, r = (2,1) convolutional code. 


Input: file of unpacked unipolar binary data (Os and Is only! with spaces in between them). 
Output: file of unpacked unipolar binary data. 


#include <iostream.h> 
#include <fstream-h> 
#include <stdlib.h> 


#define K 7 

#define RATE 2 


void convoIve(unsigned short, unsigned short[]); 

main(int argc, char *argv[]) 

{ 


/* variables: 

* bit -> input 0 or 1 bit 

* codeSymbol -> array of length 2 for the output code symbols 
*/ 

unsigned short bit; 

unsigned short codeSymbol[RATE] = {0}; 

/* read in command line parameters. 

*/ 

if (argc != 3) { 

cout« “function: ‘fec7enc’ codes binary data with K=7 (2,1) “ 

« “conv. coder” « endl 
« “Usage: fec7enc infile outfile” « endl; 


else { 

ifstream binln(argv[l], ios::in); 
if (Ibinln) { 

cerr « “Input file “ « argvfl] « “ could not be opened” « endl; 
exit(l); 

} 


61 


ofstream fecOut(argv[2], ios::out); 
if (IfecOut) { 

cerr « “Output file “ « argv[2] « “ could not be opened “« endi; 
exit(i); 




/* read in one bit (must be unipolar!! -> a 0 or 1) at a time and compute the 

* two output code symbols. Output to file in unpacked form - ie. a space 

* between values. 

*/ 

while (binin » bit) { 
convolve(bit,codeSymbol); 

fecOut« codeSymboi[0] « “ “ « codeSymbol[l] « “ 

} 


/* flush the register with K > 1 = 6 zeros. 

*! 

for (unsigned short flush = 1; flush < K; flush++) { 
convolve(flush,codeSymbol); 

fecOut« codeSymbol[0] « “ “ « codeSymbol[l] « 

} 


fecOut.closeO; 

binIn.closeO; 

} 

return 0; 

} 


/* function computes the two output binary symbols using 
*/ 

void convolve(unsigned short inBit, unsigned short u[]) 

{ 

/* variables; 

* shift_reg -> stores the 7 bits in the register 

^ and values must be kept between function calls 

* same -> sum of bits in the register which are used 

* to compute both codesymbols 
*/ 

static unsigned short shift_reg[K] = {0}; 
unsigned short same; 

for (int n = (K-1); n >=1; n-) { 
shift_reg[n] = shift_reg[n-l]; 

} 


shift_reg[0] = inBit; 

/* code vectors used to determine code bits ul, u2 using modulo 2 addition: * 

* (non-systematic): vl => 1 00 1 1 1 i 
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v2 => n 0110 1 


*/ 

same = shift_reg[0] + shift_reg[3] + shift_reg[4] + shift_reg[6]; 

/* for every input bit, 2 output bits are generated. 

*/ 

u[0] = (same + shift_reg[5]) % 2; 
u[l] = (same + shift_reg[l]) % 2; 




filename: viterbi.C 

Viterbi Decoder for K = 7, r = 1/2 convolutional codes 


Function: implements a Viterbi decoder for an input stream of unpacked unipolar 
binary data 

Input: file of unipolar binary data - representing demodulated/detected data stream. 
Output: file of unpacked unipolar binary data - estimate of information sequence 


Associated header files: 


#include <iostream.h> 
#include <fstream.h> 
#inciude <stdlib.h> 


#include “vit7_consts.h” 

unsigned short states[NOS*2][4] = { 
#include “vit7hamwts.h” 

}; 


#include “vit_functions.h” 

main(short int argc, char *argv[]) 

{ 


/* variables: 

* c - index arrays for appropriate 2 bit input 

* ul, u2 • received 2 bit code symbols 

* input - 
*f 


unsigned short c, ul, u2, input = 0; 


/* read command line parameters 
*/ 


if (argc != 3) { 

cout« “function: ‘viterbi7’ decodes a K = 7, r =1/2 conv. coded 
« “bitstream” « endl 
« “Usage: viterbi7 infile outfile” « endl; 
exit(l); 

} 

else { 
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ifstream vitln(argv[l], ios:;in); 
if (!vitln) { 

cerr « “Input file “ « argv[l] « “ could not be opened” « endl; 
exit(l); 

} 

ofstream vitOut(argv[2], ios::out); 
if (IvitOut) { 

ccrr « “Output file “ « argv[2] « “ could not be opened” « endl; 
exit(l); 

} 


/* initialize and allocate memory for all NOS path histories - this is a C++ 

* construct (use malloc/delete for C implementation. 

*/ 

for (int p = 0; p <= (NOS - 1); p++) { 
pathHist[p] = new unsigned short[HISTORY]; 
newPath[p] = new unsigned short[HISTORY]; 

} 

/* recover a pair of code symbols and determine index, c, from inputs ul, u2. 

*/ 

while (vitin » ul) { 
vitin » u2; 
c = 2*ul + u2; 

/* index Hamming distances for each pair of branches into each node, add 

* to respective source distance totals, and select one. (this doesn’t make 

* much sense does it?) 

*/ 

findNewPath(c,input); 

updatePathHist(input); 

input++; 

/* compare all of the paths and decode them after every CHECKPATH new branches. 
*/ 

if ((input % CHECKPATH) == 0) { 
input = decodeTrellis(input, vitOut); 


} 

/* when decoding finished, de>allocate and clear memory requirements - once 
* again, these are C++ commands. 

*1 

for (p = 0; p <= (NOS -1); p++) { 
pathHist[p] = delete[]; 
newPath[p] = deletc[]; 

} 


vitIn.closeO; 
vitOut.closeO; 
return 0; 

} 
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/* function decodes trellis path to a bit pattern after the paths have 

* converged some distance behind the current iteration. 

* arguments passed to function: 

int decodeTrellis(int input, ofstream &toFile) 

{ 

/* variables: 

* flagA - set if only one node is common (ie. the first one) in 

* the paths and ensures it is not ‘lost’ between calls 

* counter - tracks the number of common nodes 

* flagB - ‘0’ indicates paths have converged at least at one node 

* hansel > common node used for bit identification 
*/ 


static short flagA = 0; 

unsigned short counter = 0, flagB = 1, hansel; 

/* check for convergence in path histories. 

*/ 

while (flagB == 1) { 

/* starting with the first node, check to see if any of the paths don’t 

* begin w/ the same one. If they don’t, then the paths haven’t converged 

* yet so set flagB and leave this function call. 

*/ 

for (unsigned short n = 0; n < (NOS - 1); n++) { 

if (*(pathHist[n] + counter) != *(pathHist[n+l] + counter)) { 
flagB = 0; 


} 

/* if a node is similar amongst all of the paths, then the paths have 
* converged in at least one location...check for more. 

*/ 

if (flagB ==1) 
counter++; 

} 


/* if there are any path similarities starting from the beginning (among !all! * 
*l 

if (counter > 0) { 

for (unsigned short p = 0; p < counter; p++) { 
hansel = *(pathHist[0] + p); 

/* output the bits to output file. Each bit is determined by the branch 

* between two nodes...if there is only one common node, a bit can’t be 

* decoded (this is why flagA is set to ‘ T below) 

*/ 

if ((p> 0)11 (flagA ==!)){ 
if (hansel <= (NOS/4* 1)) { 


* the paths) then decode them. 
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toFile « 0 « ‘ 

) else if ((hansel >= NOS/2) && (hansel < (3*NOS/4))) | 
toFile « 0 « ‘ 

} else { 

toFile « i « ‘ 

} 


/* set flagA to ‘r to indicate only one node is in common and must be used 
* dufing the next function call to decode a bit. 

*/ 

else { 
flagA = 1; 


i 

/* update all paths: reset pointers to new starting nodes and clear previous 
* decoded history. 

*/ 

for (n = 0; n <= (NOS - 1); n++) { 

for (int p = counter; p < input; p++) { 

*(newPath[n] + p - counter) = *(pathHist[n] + p); 

*(pathHist[n] + p ~ counter) = *(newPath[n] + p - counter); 

} 

) 

/* if a there was path convergence, return this... 

*/ 

return (input - counter); 

} 


/* otherwise return this... 
*/ 

return input; 

} 


filename; vit_functions.h 

contents: Function prototypes and definitions for Viterbi r = 1/2 decoder. 

************* ****s{:*:<c**********************:J:** He*** ******************:{: *****,j;y 


void findNewPath(int, int); 

void updatePathHist(int); 

int decodeTrellis(int, of stream &); 


/* findNewPath: finds the shorter of the two paths (using the Hamming distance) 

* between each state and its two source states. Then the total 

* path weights are updated for the current time step. 

*/ 

void findNewPath(int c, int input) 

{ 
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unsigned short minValue, index; 
unsigned short stateTempWts[2] = {0}; 

for (int n = 0; n <= (NOS - 1); n++) { 

/* calculate both Hamming weights for a node during an interation and 

* eliminate the larger one. 

*f 

stateTempWts[0] = states[n * 2][c] + pathWts[source[0][n]]; 
stateTempWts[l] = states[(n*2)+l][c] + pathWts[source[l][n]]; 

if (stateTempWts[0] <= stateTempWts[l]) { 
minValue = stateTempWts[0]; 
index = 0; 

} 

else { 

minValue = stateTempWts[l]; 
index = 1; 

} 

/* update the path metric for the node based on the shorter path found 

* above. 

*/ 

new Path Wts[n] = stateTempWts[index]; 

for (int p = 0; p < input; p++) { 

*(newPath[n] + p) = *(pathHist[source[index][n]] + p); 

} 

*(newPath[n] + input) = source[index][n]; 

} 

} 


/* updatePathHist: the new path of each of the states at time t is 
* the path histories for the next time, t+1. 

*/ 

void updatePathHist(int input) 

{ 

for (int n = 0; n <= (NOS -1); n++) { 
for (int p = 0; p <= input; p++) { 

*(pathHist[n] + p) = *(newPath[n] + p); 

} 

pathWts[n] = newPathWts[n]; 

} 

} 


filename: vit7_consts.h 


contents: constants, transition state values, and global variables of 


Viterbi decoder for K = 7, r = 1/2 coder. 

******************************************************* ****** *********4c**y 


#define NOS 64 /* number Of states - of encoder */ 
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#define HISTORY 40 /* retain this much path history through the 

trellis for each state ♦/ 

#define CHECKPATH 15 /* check for similarity between paths once every 

CHECKPATH timesteps */ 


/* source states for each of the 64 states - each state has only 2 possible 

* states of origin. Each column.i, represents the 2 source nodes for the 

* ‘i’th node. 

*/ 


unsigned short source[2][NOS] = 

{{0, 2,4, 6, 8,10, 12, 14, 16, 18, 20, 22, 24, 26,28, 30, 

0, 2,4, 6, 8, 10, 12, 14, 16, 18,20, 22,24, 26, 28,30, 

1,3, 5, 7, 9, 11, 13, 15. 17, 19,21,23, 25,27, 29,31, 

1,3, 5,7,9, 11, 13, 15, 17, 19,21,23, 25,27,29,31), 

(32, 34, 36, 38. 40, 42, 44, 46.48, 50, 52, 54, 56, 58, 60, 62, 

32, 34,36, 38,40,42,44,46,48, 50, 52, 54,56, 58, 60, 62, 

33, 35,37, 39,41,43,45,47,49, 51, 53, 55,57,59,61, 63, 

33,35,37, 39,41,43,45,47,49, 51,53, 55,57, 59,61, 63,)); 


unsigned short pathWts[NOS] = {0}, newPathWts[NOS] = {0}, 
*pathHist[NOS], *newPath[NOS]; 


^:*;*:^:i=:k*********Ji<******4:****** ****** ***:<«* *********si«*******st:***********sls*4!*sit=l«i|ss^^ 

filename: vitThamwts.h 

contents: the Hamming weights between the 4 possible input bit pairs (00, 01, 

10, and 11) and the correct bit pattern produced by the transition 
between a source node and the destination node. 

****5ic*;i«:4:H«*****************:i:***** + :je>ls>f;5f::je,fc*sic**=i:si«sk**************=f:*sk***********Jk**/ 

{ 0 , 1 , 1 , 2 },{ 2 , 1 , 1 , 0 }, 

{2, I, 1,0},{0, 1, 1,2}, 

{ 2 , 1 , 1 , 0 },{ 0 , 1 , 1 , 2 }, 

{ 0 , 1 , 1 , 2 },{ 2 , 1 , 1 , 0 }, 

{ 0 , 1 , 1 , 2 },{ 2 , 1 , 1 , 0 }, 

{2, 1, 1,0},{0, 1, 1,2}, 

{ 2 , 1 , 1 , 0 },{ 0 , 1 , 1 , 2 }, 

{0, 1, 1,2},{2, 1, 1,0}, 

{ 1 , 0 , 2 , 1 },{ 1 , 2 , 0 , 1 }, 

( 1 , 2 , 0 , 1 },{ 1 , 0 , 2 , 1 }, 

(1,2, 0, 1},(1,0, 2, 1}, 

(1,0, 2, 1},{1,2, 0, 1}, 

(1,0, 2, 1},{1,2, 0, 1}, 

( 1 , 2 , 0 , 1 },{ 1 , 0 , 2 , 1 }, 

( 1 , 2 , 0 , 1 },{ 1 , 0 , 2 , 1 }, 

( 1 , 0 , 2 , 1 },{ 1 , 2 , 0 , 1 }, 

(2, 1, 1,0},(0, I, 1,2}, 

(0, I, 1,2},(2, 1, 1,0}, 

(0, 1, 1,2},(2, 1, 1,0}, 

( 2 , 1 , 1 , 0 },{ 0 , 1 , 1 , 2 }, 

(2, 1, 1,0},(0, 1, 1,2}, 

( 0 , 1 , 1 , 2 },( 2 , 1 , 1 , 0 }, 

( 0 , 1 , 1 , 2 },( 2 , 1 , 1 , 0 }, 
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{ 2 , 1 , 1 , 0 },{ 0 , 1 , 1 , 2 ), 
{ 1 , 2 , 0 , 1 },( 1 , 0 , 2 , 1 }, 
{ 1 , 0 , 2 , !},{!, 2 , 0 , 1 }, 
{ 1 , 0 , 2 , 1 },{ 1 , 2 , 0 , 1 }, 
{ 1 , 2 , 0 ,!},{!, 0 , 2 , 1 }, 
{ 1 , 2 , 0 , 1 },{ 1 , 0 , 2 , 1 }, 
{ 1 , 0 , 2 , !},{!, 2 , 0 , 1 }, 

{ 1 , 0 , 2 , !},{!, 2 , 0 , 1 }, 

{ 1 , 2 , 0 , 1 },{ 1 , 0 , 2 , 1 }, 

{ 1 , 2 , 0 , !},{!, 0 , 2 , 1 }, 

{ 1 , 0 , 2 , 1 },{ 1 , 2 , 0 , 1 }, 
{ 1 , 0 , 2 , !},{!, 2 , 0 , 1 }, 

{ 1 , 2 , 0 , !},{!, 0 , 2 , 1 }, 

{ 1 , 2 , 0 , 1 },{ 1 , 0 , 2 , 1 }, 

{ 1 , 0 , 2 , !},{!, 2 , 0 , 1 }, 

{ 1 , 0 , 2 , !},{!, 2 , 0 , 1 }, 

{ 1 , 2 , 0 , !},{!, 0 , 2 , 1 }, 

{ 2 , 1 , 1 , 0 },{ 0 , 1 , 1 , 2 }, 

{ 0 , 1 , 1 , 2 },{ 2 , 1 , 1 , 0 }, 

{ 0 , 1 , 1 , 2 },{ 2 , 1 , 1 , 0 ), 

{ 2 , 1 , 1 , 0 },{ 0 , 1 , 1 , 2 }, 

{ 2 , 1 , 1 , 0 },{ 0 , 1 , 1 , 2 }, 

{ 0 , 1 , 1 , 2 },{ 2 , 1 , 1 , 0 }, 

{ 0 , 1 , 1 , 2 },{ 2 , 1 , 1 , 0 }, 

{ 2 , 1 , 1 , 0 },{ 0 , 1 , 1 , 2 }, 

{ 1 , 0 , 2 , !},{!, 2 , 0 , 1 }, 

{ 1 , 2 , 0 , !},{!, 0 , 2 , 1 }, 

{ 1 , 2 , 0 , 1 },{ 1 , 0 . 2 , 1 }, 

{ 1 , 0 , 2 , !},{!, 2 , 0 , 1 }, 

{ 1 , 0 , 2 ,!},{!, 2 , 0 , 1 }, 
{ 1 , 2 , 0 ,!},{ 1 , 0 , 2 , 1 }, 
{ 1 , 2 , 0 , 1 },{ 1 , 0 , 2 , 1 }, 
{ 1 , 0 , 2 , 1 },{ 1 , 2 , 0 , 1 }, 
{ 0 , 1 , 1 , 2 },{ 2 , 1 , 1 , 0 ), 

{ 2 , 1 , 1 , 0 },{ 0 , 1 , 1 , 2 }, 

{ 2 , 1 , 1 , 0 },{ 0 , 1 , 1 , 2 }, 

{ 0 , 1 , 1 , 2 },{ 2 , 1 , 1 , 0 }, 

{ 0 , 1 , 1 , 2 },{ 2 , 1 , 1 , 0 }, 

{ 2 , 1 , 1 , 0 },{ 0 , 1 , 1 , 2 }, 

{ 2 , 1 , 1 , 0 },{ 0 , 1 , 1 , 2 }, 
{ 0 , 1 , 1 , 2 },{ 2 , 1 , 1 , 0 } 
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APPENDIX C. COMPUTER CODE FOR BASEBAND MODULATOR 

AND QPSK SYSTEM 


The first program below implements a simple baseband modulator with a BSC 
channel. The code following, is the main routine and header files of the full QPSK system. 
In the program qpsk.C, the header files containing the filter coefficients of the demodulator 
LPF and the matched filter are not included due to their length. 

4 :** ****************** ***********3i«*5i«5j:**sti*5ic*Jlc*ii:******5}c*5C**************:l«****** 

filename: baseband.C 

Baseband Modulator using BSC 

Function: implement a simple baseband modulator across BSC channel by 
applying user defined Pb to each bit 


Input: read in a binary file - unipolar unpacked binary 
Output: unipolar unpacked binary to a file 

**************** ********************************5j.*:i;:(;*4::i:5{.;jj^5jc5|c:(.jU:j;:j.^5|j;^jj.5^5^5^^j,jj^^j,;^y 


#include <iostream.h> 

#include <iomanip.h> 

#include <fstream.h> 

#include <stdlib.h> 

#ifndefRANDMAX 
#define RANDMAX 32767 
#endif 

main(int argc, char *argv[]) 

{ 

/* variables: 

* bit - input bit 

* newBit - output bit 

* errorMask - used to ‘flip’ a bit if an error occurs 

* chance - calculate if input bit will undergo ‘error’ 

* p_error - user input Pb 

*/ 

unsigned short bit, newBit 
errorMask=0; 
float chance, p_error = 0.0; 

/* file i/o 
*/ 

if (argc != 3) { 

cout « “function: ‘baseband’ simulates a simple baseband channel “ 
« “for any binary data” « endl 
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« “Usage: baseband infile outfile” « endl « end!; 
exit(l); 

} 

else { 

ifstream channelln(argv[l], ios::in); 
if (!channelln) ( 

cerr « “Input file “ « argv[l] « “could not be opened “« endl; 
exit(l); 

} 

ofstream bbOut(argv[2], ios::out); 
if (IbbOut) { 

cerr « “Output file “ « argv[2] « “could not be opened” « endl; 
exit(l); 

} 


cout« “\nEnter the probability of bit error => “; 
cin » p_error; 

/* for each input bit, calculate a chance and cause an error if chanc <= Pb. 
*/ 

while (channelln » bit) { 
chance = (rand()/(float)RANDMAX) + p_error; 
errorMask = (unsigned short)floor(chance); 

/* perform bit level XOR operation to change the bit if an errors as 
* determined above. 

*/ 

if (errorMask == 1) { 
bit = bit ^ errorMask; 

} 


bbOut « bit« “ 

} 


bbOut.closeO; 
channelln.closeO; 
return 0; 

} 


:tc **** Jfc * St: ^ si- *** ^ *** *** **** sfs * ^ ^ sK * * 

filename: qpsk.C 

Digital QPSK System 

Function: modulates, adds noise, and demodulates a data stream using QPSK 
modulation 
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Input: file (not packed) of unipolar binary data (0 or 1 only). 
Output: unpacked demodulated and detected data stream 

Associated header files: qpskmod.h, demod.h, noise.h, bpf.h 


#include <:iostream.h> 
#include <fstream.h> 
#include <stdlib,h> 
#include <math.h> 

#include “qpskmod.h” 
#include “demod.h” 
#include “noise.h” 
#include “bpf.h” 

double dlpf[N] = { 
#include “dlpf.h” 

}; 


double matched[NPB] = { 
#include “mfcoeffs.h” 

}; 


main(int argc, char *argv[]) 

{ 


/* variables 


Ibit, Qbit 

nPerBaud 

index 

delay 

m 

normalize 


inphase/quadrature bits (modulate cos/sin) 
number of samples per baud 
access sinusoid lookup table 
signal delay through the system 
NPB (samples^aud) 

scale sinusoids by l/(sqrt2) so sum == 1 
inPhase/quadPhase - inphase/qphase sinusoids 
qpsk/qpskfilt - original/filtered qpsk signals 
scale - std dev of AWGN 
noise - the AWGN sample (prior to filtering) 
rcvd - s[n] + noise[n] at demodulator input 
demodl/demodQ 

Imfout/Qmfout - output from the two matched filters 
rl[], rQ[] - input buffers to I/Q demodulator LPFs 
Imfin[], QmfinC - input buffers to the I/Q matched filters 


short Ibit, Qbit; 
unsigned short nPerBaud, 
index, n, 


delay = DELAY, m = NPB; 
double normalize, 

inPhase, quadPhase, 
qpsk, qpskfilt, 


*/ 
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scale, noise, 
rcvd, 

demodi = 0.0, demodQ = 0.0, 

Imfout = 0.0, Qmfout = 0.0; 
double rI[N] = {0}, rQ[N] = {0}, 

Imfin[NPB] = {0}, Qmfin[NPB] = {0); 


/* Open input and output files */ 
if (argc != 3) { 

cout« “function: ‘qpsk’ - QPSK simulation w/ awgn channel” « endl 
« “Usage: qpskmod infile outfile” « endl « endl; 
exit(l); 

} 

else { 

ifstream dataln(argv[l], ios::in); 
if (idatain) { 

cerr « “Input file “ « argv[l] « “could not be opened “« endl; 
exit(l); 

} 

ofstream qpskS(argv[2], ios::out); 
if (IqpskS) { 

cerr « “Output file “ « argv[2] « “could not be opened” « endl; 
exit(l); 

} 


cout « “Enter the std of the noise: “; 
cin » scale; 
cout« endl; 


normalize = l/sqrt(2); 
index = 1; 

/* input the two bits per QPSK symbol. 

*/ 

while (datain »Ibit) { 
datain » Qbit; 

/* convert to bipolar bit stream */ 

Ibit = 2*Ibit- 1; 

Qbit = 2*Qbit-1; 

/* for NPB samples per baud, use the same Ibit and Qbit to modulate the qpsk 
* signal. 

*/ 

n = 1; 
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while (n <= NPB) { 


/* sinusoids are accessed from lookup table of 8 values (fs = 8*fc). 

*/ 

inPhase = Ibit * cosLookUp[(index + 1) % Nyq]; 
quadPhase = Qbit * cosLookUp[(index - 1) % Nyq]; 
qpsk = normalize * (inPhase + quadPhase); 

/* generate an AWGN sample, add it to the QPSK value and filter both - they can 

* both be filtered separately and summed to ‘more clearly’ denote that the 

* noise is filtered prior to addition but this is faster (and since it is 

* a linear operation, it produces the same result. 

*/ 

noise = scale*gaussian(); 
rcvd = filter(qpsk + noise); 

/* INSERT interpolation and decimation routines here to more realistically 

* simulate an analog signal, 

*/ 

/* demodulate the ‘received’ signal. 

*/ 

/* INSERT phase locked loop routine here (as a do-while loop) to simulate 

* more realistic signal recovery. 

*/ 

rl[0] = rcvd * cosLookUp [(index + 1) % Nyq]; 
rQ[0] = rcvd * cosLookUp[(index - 1) % Nyq]; 

demodi = rI[(N-l)/2]*dlpf[(N-l)/2]; 
demodQ = rQ[(N-l)/2]*dlpf[(N-l)/2]; 

for (unsigned short int tap = 0; tap < (N - l)/2 ; tap++) { 
demodi += (rl[tap] + rI[N-tap-l]) * dlpf[tap]; 
demodQ += (rQ[tap] + rQ[N-tap-l]) * dlpf[tap]; 

} 


for (unsigned short update = (N -1); update > 0; update—) { 
rl[update] = rl[update -1]; 
rQ [update] = rQ [update - 1]; 

} 

if (delay > 0) { 
delay-; 


/* used matched filters to ‘detect’ the I and Q bits. Interleave and output 
* the results to the output file. 

*/ 

else { 
m-; 

Imfout += matched[m]*demodI; 
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Qmfout += matched[m]*demodQ; 


if (m == 0) { 
if (Imfout > 0) { 
qpskS « 1 « ‘ 

] 

else { 

qpskS « 0 « ‘ 

} 

if (Qmfout > 0) { 
qpskS « 1 « ' 

} 

else { 

qpskS « 0 « ‘ 

) 


Imfout = 0.0; 
Qmfout = 0.0; 
m = NPB; 

} 


if (index == Nyq) { 
index = 0; 


index++; 

n++; 

} 

} 

/* at the end of the data stream if there are values left in the matched filter 
* then output two zeros as the last symbol. 

y 

if (m != NPB) { 
qpskS « 0 « ‘ ‘ « 0; 

} 


qpskS .closeO; 
datalnxloseO; 
return 0; 

} 

} 


filename: qpskmod.h 

contents: QPSK modulation/demodulation parameters 

*****************He*******************************************He******* ********/ 
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#define 

D 4800 /* symbol rate for data rate of 9600 bps */ 

#define 

fc 455000 

/* intermediate freq */ 

#define 

fs 3640000 

oo 

II 

* 

#define 

Nyq 8 

/* sampling rate */ 

#define 

DELAY 452 

/* ! the drawback of not having a PLL ! */ 

#define 

NPB 758 

/* round(fs/D) */ 

/* digital 

filter parameters; 


* 

type; Butterworth BPF 

* 

passband: fc +- 

0.3*R==> 452.12 kHz < pass <457.88 Hz 

* 

order: 6th 



coefficients: see below (aO at the top, a6 at the bottom) 

*/ 



#define 

TAPS 7 



double a[TAPS] = 

{ 

1.00000000000000, 

-4.22863200584110, 

8.94059042311700, 

-11.20184667623209, 

8.88152415382525, 

-4.17294333434056, 

0.98031108137184 

}, 

b[TAPS] = 

{ 

0.00000012164818, 

0 . 0 , 

-0.00000036494454, 

0 . 0 , 

0.00000036494453, 

0 . 0 , 

-0.00000012164818 


cosLookUp[Nyq] = 

{ 

0.00000000000000, 

0.70710678118655, 

1.00000000000000, 

0.70710678118655, 

0.00000000000000, 

-0.70710678118655, 

-1.00000000000000, 

-0.70710678118655 

}; 
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/* function prototype for BPF filter function. 
*/ 

double filter(double); 


y:*: if: * :jc ^ * li; ^ jf; *** ^ sis *:{:***** 4 : ^ * Jf: :4c sf:* :1c ^ ^ ^ 

filename: demod.h 

contents: function prototypes for noise generator and constant definition of 
demodulator LPF 

#define N 37 /* number of taps of demodulator LPF */ 

double gaussianO; 
double noiseFilter(double); 


jfcilc if:* if: 4:*** ****:f5 *****************:!<************** if: :1c 

filename: bpf.h 

contents: filter definition of BPF used on qpsk signal and AWGN 

**************i}:*********;is;fc*****************if;******;ic*:};*s}:^5jc*****************5f;:issfsy 


/* filter - a Butterworth BPF channel filter. For qpsk it removes spectral 

* sidelobes; and acts as noise filter, 

*/ 

double filter(double input) 

{ 

/* variables; 

* yHist[], xHist[] - retain input/ouptut filter history 
*/ 

static double yHist[TAPS] = {0}, 
xHist[TAPS]= {0}; 

double output = 0.0; 

xHist[0] = input; 
yHist[0] = b[0]*xHist[0]; 

for (unsigned short int i = 1; i < TAPS; i++) { 
yHist[0] = yHist[0] + b[i]*xHist[i] - a[i]*yHist[i]; 

} 

output = yHist[0]; 

/* update filter histories after each step, 

*/ 

for (unsigned short update = (TAPS > 1); update > 0; update--) { 
xHist[update] = xHist[update - 1]; 
yHist[update] = yHist[update - 1]; 
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return output; 


} 

filename: noise.h 

contents: function definitions used to generate AWGN 

NOTE: this source code obtained from Embree [Ref. 13] 

/* gaussian - generates zero mean unit variance Gaussian random variables. 

* Uses Box-Muller transformation of two uniform r.v. 

*/ 

double gaussianO 

{ 

static int ready = 0; /* flag to indicate stored value */ 

static double gstore; /* place to store other value */ 
double vl, v2, r, fac, gaus; 

double uniformO; 

/* make two numbers if none stored. 

*l 

if (ready == 0) { 
do { 

vl = 2.*uniform(); 
v2 = 2.*uniform(); 
r = vl*vl + v2*v2; 

} while(r >1.0); /* for radius less than 1 */ 

/* remap vl and v2 to Gaussian numbers. 

*/ 

fac = sqrt(-2.*log(r)/r); 

gstore = vl*fac; /* store one of the two */ 

gaus = v2*fac; /* return the other one */ 

ready = 1; /* set ready flag */ 

} 

else { 

ready = 0; /* reset ready flag for next pair */ 

gaus = gstore; /* return the stored one */ 

} 

return gaus; 

} 

/* uniform - generates zero mean uniform rv from -0.5 to 0.5. This function 

* is called by gaussian(). 

*/ 
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#ifndefRAND_MAX 
#define RAND.MAX 32767 
#endif 

double uniformO 

{ 

return ((double)(rand() & RAND_MAX)/RAND_MAX - 0.5); 

} 
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APPENDIX D. SEQUENCE OF COMMANDS TO RUN QPSK 
SIMULATION ON CELP CODED SPEECH 


The following list of commands must be run from a shell tool on a SUN 
SPARCstation in the order presented. Although the commands are given for CELP coded 
data, any data that is in unipolar binary format can be run through the channel encoders/ 
decoders and QPSK system. Below, the input file is assumed to be an audio file in the SUN 
\au’ format. The shell command line arguments are in bold followed by a brief description 
of the function. For commands with two filename arguments, the first file is the input and 
the second is the output. Except as noted, the filename suffixes can be anything that 
clarifies the file content to the user - the ones shown are merely representative and used for 
clarity. 


1) format_enc -3 <fUename.m\fovm 2 it_dec -3 -1 >filename.s^d - this function converts the 
8 bit |i-law audio file to 16 bit linear PCM using the CCITT G.723 24kbps ADPCM coder as the conversion 
interface. The ‘.spd’ suffix on the output file is required for the following CELP encoder. Note that there are 
no spaces between the {<,>} symbols and the filenames. 

2) celp_encode -i filename.s^d -o filename! - when the coder executes, it generates a Mog’ 
file, which contains detailed information on the performance of the coder during this run, and three decoded 
files using different types of output filters all of which have an ‘.spd’ filename suffix - these four files can be 
discarded. It also produces a channel file, filename!,chan for intermediate processing. This is the file of 
interest. It is in packed hexadecimal format with 36 characters/line (frame). 

3) hex2bin filenamexhan filename.hin - converts each hex character in the input file into its 4 
bit binary form and outputs the results to a file with a single space between each binary value. 

4) fecTenc fUename.hin filename.Kl - applies a = 7, r = 1/2 convolutional code to the input 
data and outputs the coded sequence to a file. The last bit in the input sequence is flushed through the register 
with 6 zeros to fully involve it in the encoding procedure. AK = 5 coder may be substituted her: the command 
is fecSenc followed by equivalent filename notation. 

5) qpsk filename.Kl filename. - converts the input unipolar {0,1} data into bipolar (-1,1} 
data and performs QPSK simulation. The data isfirst modulated aifi = 455 kHz using anfi = 3640 Hz, then 
filtered and summed with bandlimited AWGN. The user enters the standard deviation of the AWGN, 
GawgN’ ^ser prompt. Demodulation is performed with two 36th order FIR LPFs (one each for the I 
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and Q streams). Detection is by means of two matched filters of length 758 (= number of samples per baud). 
All filter coefficients are in header files and are valid only for the given carrier frequency and simulation 
frequency. All intermediate values are in the simulation use double floating point precision - output data to 
file is in unipolar binary format. 

Instead of performing a QPSK simulation, a baseband simulation across a BSC can be performed by 
using the command baseband followed by the input and output files. 

6) viterbi? filename.filename.yl - performs hard decision Viterbi decoding on K =7, r = 
1/2 convolutional codes. If a /iT = 5 coder is used, viterbiS must be used to decode the sequence. 

7) bin2hex filename.yl filenamel.chan - converts unpacked unipolar data back to packed hex 
format for the CELP decoder. The data is rearranged into lines of 36 characters per line (corresponding to 
one coded CELP frame). 

8) ceIp_decode -c filename.chm ~o filename! - decodes the input channel file into three output 
files. Two of these are filtered versions and will have the suffixes y?/ertam€2npf.spd Midfilenamelh^is^d - 
these can be discarded. The third Tilt filename!, spd is the file of interest and it is in 16 bit linear PCM format. 

9) format_enc -3 -1 <^/^nai»e.spd I forinat_dec -3 >filename.r.SLn - converts the 16 bit linear 
PCM file to 8 bit |i-law compressed data (without a header). The ‘.r’ in the filename is used to indicate that 
it represents received information. The user should used different filenames to avoid overwriting the original 
file. 

10) raw2audio filename.r.sax - appends SUN audio file header so it can be recognized and piayed 
by a SPARCstation. 

Any data in unipolar binary format can be input to the system at step 4 and accessed again after step 
6 but the data requires source coding/decoding software and interface software as necessary to make the 
information suitable for the simulation. 
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APPENDIX E. CALCULATION OF E^/No 


The following MATLAB script computes the parameter in dB and 

corresponding for a given AWGN noise variance. The measurements are taken at the 

input to the matched filter detectors. 

% 

% filename: ebno.m 

% function: calculates Eb/No for different noise variances - 
% change ‘scale_noise’ value to measure for different noise levels. 

% 

clear 

fc = 455000; % carrier freq. 

fs = 8 * fc; % sampling frequency 

BR = 9600; % bit rate into modulator (= 4800 baud) 

W = ([-0.3*BR 0.3*BR] + fc)/(fs/2); % channel passband is fc +- 0.3xBitrate 

[bl,al] = butter(3,W); % design Butterworth filter (6th order) 

points = 200000; % test points 

scale_noise = 7; % variance = scale_noise ^ 2 (AWGN power) 

noise = scale_noise*randn(l,points); 

channelNoise = filter(bl,al,noise); % bandpass filter the AWGN 

b = firl(36,fc/(fs/2)); % demodulator LPF 

detectedNoise = filter(b,l,channelNoise); % the output of this filter is 

% noise value at the matched filter 
% input, 

B = 5760; % bandwidth of channel 

No = std(detectedNoise)^2/B; % average power of noise at detector 

Eb = 0.00002; % energy per bit (Joules) calculated over 

% 500000 points on I and Q channels 

EbNo = 10*logl0(Eb/No); % dB value of Eb/No 

Pb = q(sqrt(2*Eb/No)); % Pb = Q(sqrt(2*Eb/No)) 
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